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Preface 



This text grew out of notes I have used in teaching a one quarter course on integration 
at the advanced undergraduate leveL My intent is to introduce the Lebesgue integral 
in a quick, and hopefully painless, way and then go on to investigate the standard 
convergence theorems and a brief introduction to the Hilbert space of functions 
on the interval. 

The actual construction of Lebesgue measure and proofs of its key properties 
are relegated to an appendix. Instead the text introduces Lebesgue measure as a 
generalization of the concept of length and motivates its key properties: monotonicity, 
countable additivity, and translation invariance. This also motivates the concept of 
cr-algebra. If a generalization of length has these properties then to make sense it 
should be defined on a cx-algebra. 

The text introduces null sets (sets of measure zero) and shows that any general- 
ization of length satisfying monotonicity must assign zero to them. We then define 
Lebesgue measurable sets to be sets in the a-algebra generated by Borel sets and null 
sets. 

At this point we state a theorem which asserts that Lebesgue measure exists and 
is unique, i.e. there is a function fi defined for measurable subsets of a closed interval 
which satisfies monotonicity, countable additivity, and translation invariance. 

The proof of this theorem (Theorem (12.4.21) ) is included in an appendix where it is 
also shown that the more common definition of measurable sets using outer measure 
is equivalent to being in the a-algebra generated by Borel sets and null sets. 

The text presupposes a background which a student obtain from an undergraduate 
course in real analysis. Chapter summarizes these prerequisites with many proofs 
and some references. Chapter 1 gives a brief treatment of the "regulated integral" 
(as found in Dieudonne |T]) and the Riemann integral in a way that permits draw- 
ing parallels with the presentation of the Lebesgue integral in subsequent chapters. 
Chapter 2 introduces Lebesgue measure in the way described above. 

Chapter 3 discusses bounded Lebesgue measurable functions and their Lebesgue 



i 



ii 

integral, while Chapter 4 considers unbounded functions and some of the standard 
convergence theorems. In Chapter 5 we consider the Hilbert space of functions on 
[—1,1] and show several elementary properties leading up to a definition of Fourier 
series. 

In Appendix A we construct Lebesgue measure and prove it has the properties 
cited in Chapter 2. Finally in Appendix B we construct a non- measurable set. 

This work is licensed under a Creative Commons Attribution-Noncommercial-Share 
Alike 3.0 United States License. 
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Chapter 



Background and 



Foundations 



0.1 The Completeness 



of R 



This chapter gives a very terse summary of the properties of the real numbers which 
we will use throughout the text. It is intended as a review and reference for standard 
facts about the real numbers rather than an introduction to these concepts. 

Notation. We will denote the set of real numbers by M., the rational numbers by Q, 
the integers by Z and the natural numbers by N. 

In addition to the standard properties of being an ordered field (i.e. the properties 
of arithmetic) the real numbers M satisfy a property which makes analysis as opposed 
to algebra possible. 

The Completeness Axiom. Suppose A and B are non-empty subsets of M such 
that X < y for every x & A and every y E B. Then there exists at least one real 
number z such that x < z for all x & A and z < y for all y E B. 

Example 0.1.1. The rational numbers, Q, fail to satisfy this property. If A = 
{x \ x"^ < 2} and B = {y \ y > and y^ > 2}, then there is no 2; G Q such that x < z 
for all X e A and z < y for all y E B. 

Definition 0.1.2 (Infimum, Supremum). // ^4 C M, then b E M. is called an upper 
bound for A if b > x for all x E A. The number (5 is called the least upper bound 
or supremum of the set A if (5 is an upper bound and (3 < b for every upper bound 
b of A. A number a e R is called a lower bound for A if a < x for all x E A. The 
number a is called the greatest lower bound or infimum of the set A if a is a lower 
bound and a> a for every lower bound a of A. 
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Theorem 0.1.3. If a non-empty set A C M has an upper bound, then it has a unique 
supremum (3. If A has a lower hound, then it has a unique infimum a. 

Proof. Let B denote the non-empty set of upper bounds for A. Then x < y for every 
X E A and every y E B. The Completeness Axiom teUs us there is a /3 such that 
X < (3 <y ioT every x E A and every y E B. This imphes that P is an upper bound 
of A and that (3 < y for every upper bound y. Hence /5 is a supremum or least upper 
bound of A. It is unique, because any /?' with the same properties must satisfy P < P' 
(since /3 is a least upper bound) and P' < P (since P' is a least upper bound). This, 
of course implies P = P' . 

The proof for the infimum is similar. □ 

We will denote the supremum of a set A by sup A and the infimum by inf A. 

Proposition 0.1.4. If A has an upper hound and P = sup A, then for any e > 
there is an x E A with P — e < x < P . Moreover P is the only upper hound for A with 
this property. If A has a lower hound its infimum satisfies the analogous property. 

Proof. U P = sup A and there is no x G (/3 — e, P), then every x E A satisfies x < P — e. 
It follows that /5 — e is an upper bound for A and is smaller than P contradicting the 
definition of P as the least upper bound. Hence there must be an x G A with 
X E {P - e,P). 

\i P' ^ P is another upper bound for A, then P' > p. There is no x G A with 
X E {P,P'], since such an x would be greater than P and hence P would not be an 
upper bound for A. 

The proof for the infimum is similar. □ 

0.2 Sequences in R 

There are a number of equivalent formulations we could have chosen for the Com- 
pleteness Axiom. For example, we could have take Theorem fl0.1.3p as an axiom and 
then proved the Completeness Axiom as a theorem following from this axiom. In this 
section we prove several more theorems which we will derive from the Completeness 
Axiom, but which are in fact equivalent to it in the sense that if we assumed any one 
as an axiom we could prove the others as consequences. Results of this type include 
Theorem fRT^ . Corollary flUXSj) . and Theorem fRT^ . 
We recall the definition of limit of a sequence. 
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Definition 0.2.1. Suppose is a sequence in M and L G M. We say 

lim Xn = L 

n— >oo 

provided for every e > there exists G N such that 
for all m > N. 

Let {xn}'^=i be a sequence in M. We will say it is monotone increasing if Xn+i > 
for all n and monotone decreasing if < x„ for all n. 

Theorem 0.2.2. // {xn]'^=i is a hounded monotone sequence then lim Xn exists. 

n— »oo 

Proof. If {xn\'^=i is a bounded monotone increasing sequence, let L = sup{x„}^]^. 
Given any e > there is an such that L — e<XN<Lhj Proposition fl0.1.4p . For 
any n > N we have xn Xn ^ L and hence |L — x„| < e. Thus lim x„ = L. 

If {xn}'^=i is a monotone decreasing sequence, then {—Xn}'^=i is increasing and 
lim Xn = — lim — x„. □ 

n— »oo n— »oo 

Corollary 0.2.3. // {xn}'^=i is a bounded sequence then 

lim sup{x„}^^ and lim inf{a;„} 



oo 

n=m 



both exist. We will denote them by lim sup x„ and liminfx„ respectively. The se- 
quence {xn}'^=i has limit L, i.e., limx„ = L, if and only if 

lim inf Xn = lim sup Xn = L. 

n ^oo n — >oo 

Proof. If ym = sup{x„}^^, then {ym}^=i is a monotone decreasing sequence, so 
lim ym exists. The proof that lim inf x„ exists is similar. 

m— »oo 

The fact that inf{x„}^„ < a;^ < sup{x„}^^ implies that if 

lim inf x„ = lim sup x„ = L 

then lim x„ exists and equals L. □ 
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Definition 0.2.4 (Cauchy Sequence). A sequence {xn}^=i is called a Cauchy se- 
quence if for every e > there is an N > (depending on e) such that \xn — Xm\ < e 
for all n,m > N. 

Theorem 0.2.5 (Cauchy Sequences Have Limits). If {xn}'^=i is a Cauchy sequence, 
then lim Xn exists. 

n— »oo 

Proof. First we show that if {xn}'i^=i is a Cauchy sequence, then it is bounded. For 
e = 1 there is an A^i such that \xn — Xm\ < 1 for all n,m > Ni. Hence for any n > Ni 
we have < \xn — xni \ + \xni\ < \xni \ + 1- It follows that if M = 1 + max{xn}nLi, 
then \xn\ < M for all n. Hence lim sup Xn exists. 

n— >oo 

Since the sequence is Cauchy, given e > there is an such that that \xn — Xm\ < 
e/2 for all n,m > N. Let 



L = limsupXn = lim sup{xm} 



oo 

m=n' 



Hence by Proposition fl0.1.4p there is an M > such that \xm — L\ < e/2. It follows 
that for any n > M we have \xn — L\ < \xn — xm\ + \xm — L\ < e/2 + e/2 = e. So 
lim Xn = L. □ 

n—*oo 

Definition 0.2.6 (Convergent and Absolutely Convergent). An infinite series Xr, 
of real numbers is said to converge provided the sequence {Sm}m=i converges where 
Sm = Yl^=i^n- It is said to converge absolutely provided the series J2'^=i \^n\ con- 
verges. 

Theorem 0.2.7 (Absolutely Convergent Series). If the series Yl'^=i^n converges 
absolutely, then it converges. 

Proof. Let Sm = J21Li the partial sum. We must show that lim Sm exists. We 

m— >oo 

will do this by showing it is a Cauchy sequence. Since the series \xi\ converges, 
given e > 0, there is an > such that YIIZn I-^*! ^ ^- Heiice if m > > 



oo 



\Sm. Sn\ — I ^ ^ Xi^ < ^ ^ \Xi\ < ^ ^ \Xi\ < e. 
i=n+l i=n+l i=N 

Hence {Sn} is a Cauchy sequence and converges. □ 
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0.3 Set Theory and Count ability 

Proposition 0.3.1 (Distributivity of fl and U). If for each j in some index set J 
there is a set Bj and A is an arbitrary set, then 

Ar\[jBj = [j{An Bj) andAupl Bj = U Bj). 

j&j j&J j&J jeJ 

The proof which is straightforward is left to the reader. 

Definition 0.3.2 (Set Difference, Complement). We define the set difference of sets 
A and B by 

A\B ^ {x \ X e A and x ^ B}. 

If all the sets under discussion are subsets of some fixed larger set E, then we can 
define the complement of A with respect to E to be — E \ A. 

We will normally just speak of the complement A"^ of A when it is clear what the 
larger set E is. Note the obvious facts that {A")" = A and that A\B ^ AnB". 

Proposition 0.3.3. If for each j in some index set J there is a set Bj C E, then 

Again the elementary proof is left to the reader. 

Proposition 0.3.4 (Well Ordering of N). Every non-empty subset AofN has a least 
element which we will denote min{A). 

Proof. Every finite subset of N clearly has a greatest element and a least element. 
Suppose ^4 C N is non-empty. Let 5 = {n e N | n < a for all a e ^4}. If 1 ^ then 
1 E A and it is the least element. Otherwise 1 G 5 so -B 7^ 0. Let b be the greatest 
element of the finite set B. The element — b+1 isin A and is its least element. □ 

Definition 0.3.5 (Injection, Surjection). Suppose A and B are sets and (p : A ^ B 
is a function. Then 

(1) The function (f) is called injective (or one-to-one^ if <P{x) = (t>{y) implies x = y. 

(2) The function (j) is called surjective (or ontoj if for every b E B there exists 
X E A such that 4>{x) = b. 
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(3) The function (p is called bijective if it is both injective and surjective. 

(4) IfC C B the set inverse (j)~^{C) is defined to be {a \ a e A and (f){a) e C}. IfC 
consists of a single element c we write (p~^{c) instead of the more cumbersome 

The notion of countability, which we now define, turns out to be a crucial ingre- 
dient in the concept of measure which is the main focus of this text. 

Definition 0.3.6 (Countable). A set A is called countable if it is finite or there is a 
bijection from A to the natural numbers N, (i.e. a one-to-one correspondence between 
elements of A and elements ofN). A set which is not countable is ca/Zec? uncountable. 

The following are standard properties of countable sets which we will need. 

Proposition 0.3.7 (Countable Sets). 

(1) If A is countable, then any non-empty subset of A is countable. 

(2) A set A is countable if and only if there is a surjective function f -.N ^ A. 

Proof. Item (1) is trivial if A is finite. Hence in proving it we may assume there is 
a bijection from A to N, and indeed, without loss of generality, we may assume A in 
fact equals N. 

To prove (1) suppose i? is a non-empty subset of A = N. If S is finite it is countable 
so assume it is infinite. Define : N ^ S by 0(1) = min{B), and 

0(A;)=mm(i?\{0(l),...,0(A;-l)}). 

The function is injective and defined for all k E N. Suppose m E B and let c be 
the number of elements in the finite set {n G -B | n < m}. Then 0(c) = m and hence 
is surjective. 

To prove (2) suppose f : N ^ A \s surjective. Define ip : A ^ 'H hy '^{x) — 
min{f~^{x)). This is a bijection from A to ip{A). Since ■0(^) is a subset of N it is 
countable by (1). This proves one direction of (2). The converse is nearly obvious. If 
A is countably infinite, then there is a bijection (and hence a surjection) / : N — > vl. 
But if A is finite one can easily define a surjection / : N ^ ^. □ 

Proposition 0.3.8 (Products and Unions of Countable Sets). If A and B are count- 
able, then their Cartesian product Ax B = {{a,h) | a e A, 6 e B} is a countable set. 
If An is countable for each n e N then U^^i is countable. 
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Proof. We first observe that from part (1) of the previous proposition a set A is 
countable if there is an injective function : A — > N. The function : N x N ^ N 
given by 0(m, n) = 2™3" is easily seen to be injective. This is because 2™'3" = 2''3^ 
only if 2"^~^ = 3*~". This is only possible iim — r — s — n — 0. Hence N x N is 
countable. To show A x B is countable when A and B are, we note that there are 
surjective functions f -.N ^ A and g : N ^ B so 

f X g -.n xN ^ Ax B 

is surjective. Since N x N is countable it follows from part (2) of the previous propo- 
sition that ^4 X i? is countable. 

To prove that a countable union of countable sets is countable note that if An is 
countable there is a surjection t/jn : N — > ^4^. The function 

oo 

* : N X N ^ IJ A„ 

n=l 

given by 

is a surjection. Since N x N is countable it follows that U^i is countable by part 
(2) of the previous proposition. □ 

Corollary 0.3.9 (Q is countable). The rational numbers Q are countable. 

Proof. The set Z is countable (see exercises below) so Z x N is countable and the 
function : Z x N — > Q given by m) — n/m is surjective so the set of rationals 
Q is countable. □ 

For an arbitrary set A we will denote by V{A) its power set, which is the set of 
all subsets of A. 

Proposition 0.3.10. Suppose A is a non-empty set and f : A ^(^)- Then f is 
not surjective. 

Proof. This proof is short and elegant, but slightly tricky. For a E A either a E f{a) 
or a ^ /(a). Let B ^ {a e A \ a ^ /(«)}• 

Let X be any element of A. H x & B, then, by the definition of S, we know 
X ^ f{x) and x E B so f{x) ^ B. On the other hand if x ^ S, then by the definition 
of B we know x G f{x) and since x ^ B we again conclude f{x) ^ B. Thus in either 
case f{x) ^ B, i.e. there is no x with f{x) = B, so f is not surjective. □ 
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As an immediate consequence we have the existence of an uncountable set. 
Corollary 0.3.11. The set V(N) is uncountable. 

Proof. This is an immediate consequence of Proposition (I0.3.10p and part (2) of 
Proposition flOXT]) . □ 

Corollary 0.3.12. If f : A B is surjective and B is uncountable, then A is 
uncountable. 

Proof. This is an immediate consequence of part (2) of Proposition (10 .3. 71) . since if A 
were countable the set B would also have to be countable. □ 

Later we will give an easy proof using measure theory that the set of irrationals is 
not countable (see Corollary (1A.2.7P ). But an elementary proof of this fact is outlined 
in the exercises below. 

The next axiom asserts that there is a way to pick an element from each non-empty 
subset of A. 

The Axiom of Choice. For any non-empty set A there is a choice function 

: ViA) \ {0} ^ A, 
i.e. a function such that for every non-empty subset B G A we have (f){B) G B. 
Exercise 0.3.13. 

1. Prove Propositions (10.3. ip and (I0.3.3p . 

2. {Inverse Function) 

If f : A B, then g : B A is called the inverse function of / provided 
g{f{a)) = a for all a G A and f{g{b)) = b for all b e B. 

(a) Prove that if the inverse function exists it is unique (and hence it can 
be referred to as the inverse). 

(b) Prove that / has an inverse if and only if / is a bijection. 

(c) If it exists we denote the inverse function of / by f~^. This is a 
slight abuse of notation since we denote the set inverse (see part (4) of Def- 
inition (10.3.51) ) the same way. To justify this abuse somewhat prove that if / 
has an inverse g, then for each b E B the set inverse f^^{{b}) is the set consist- 
ing of the single element g{b). Conversely show that if for every b E B the set 
inverse f~^{{b}) contains a single element, then / has an inverse g defined by 
letting g{b) be that single element. 
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3. Prove that any subset of Z is countable by finding an explicit bijection / : Z ^ 
N. 

4. {Uncountabilitity o/M) 

Let V be the set of all infinite sequences did2d^ . . .dn. . . where each dn is either 
or 1. 

(a) Prove that T> is uncountable. Hint: Consider the function / : V{N) — >■ 
T> defined as follows. If C N, then f{A) = did2d^ . . .dn - ■ ■ where (i„ = 1 if 
n e A and otherwise. 

(b) Define h :V ^ [0,1] by letting h{did2ds ... ... ) be the real number 

whose decimal expansion is 0.did2d3 . . .dn ■ ■ ■ Prove that h is injective. 

(c) Prove that the closed interval [0, 1] is uncountable. Hint: Show there 
is a surjective function : [0, 1] — > "D defined by = h~^{x) if x e hlV) and 

= otherwise. 

(d) Prove that if a < b, the closed interval {x \ a < x < b}, the open interval 
{x I a < X < b}, the ray {x \ a < x < oo}, and M are all uncountable. 



0.4 Open and Closed Sets 

We will denote the closed interval {x \ a < x < b} by [a, b] and the open interval 
{,T I a < X < 6} by (a, b). We will also have occasion to refer to the half open intervals 
{a,b] = {x \ a < X < b} and [a,b) = {x \ a < x < b}. Note that the interval [a, a] is 
the set consisting of the single point a and (a, a) is the empty set. 

Definition 0.4.1 (Open, Closed, Dense). A subset A cM. is called open if for every 
X E A there is an open interval (a, 6) C A such that x E {a,b). A subset B <Z is 
called closed if 'R\ B is open. A set A G M is said to be dense in M if every open 
subset contains a point of A. 

Proposition 0.4.2 (Q is dense in M). The rational numbers Q are a dense subset of 
R. 

Proof. Let U be an open subset of M. By the definition of open set there is a non- 
empty interval (a, b) C U. Choose an integer n such that ^ <b — a. Then every point 
of R is in one of the intervals In particular, for some integer io, < o < ^■ 

Since - < b — a it follows that 

n 

Zn — 1 in 1 , 

<a<-<a + -<b. 

n n n 
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Hence the rational number io/n is in (a, b) and hence in U. □ 

Theorem 0.4.3. An open set U dM. is a countable union of pairwise disjoint open 
intervals [jn=ii^n,bn) ■ 

Proof. Let x E U. Define = inf{?/ | [y,x] C U} and b^ = sup{y \ [x,y] C U} 
and let Ux = {ax,bx)- Then Ux C U but ax ^ U since otherwise for some e > 
0, [gx — e,ax + e] (Z U and hence [ax — e, x] C [ax — e,ax + e] U [ax + e,x] (Z U and 
this would contradict the definition of ax- Similarly bx ^ U. It follows that if z E Ux, 
then a^ = ax and bz = bx- Hence if [/^ fl f/^ ^ 0, then U^ = Ux or equivalently, if 
Uz 7^ Ux, then they are disjoint. 

Thus f/ is a union of open intervals, namely the set of all the open intervals Ux for 
X E U- Any two such intervals are either equal or disjoint, so the collection of distinct 
intervals is pairwise disjoint. 

To see that this is a countable collection observe that the rationals Q are countable 
so t/ n Q is countable and the function which assigns to each r E UCiQ the interval 
Ur is a surjective map onto this collection. By Proposition flO.3.71) this collection must 
be countable. □ 

Exercise 0.4.4. 

1. Prove that the complement of a closed subset of M is open. 

2. Prove that an arbitrary union of open sets is open and an arbitrary intersection 
of closed sets is closed. 

3. A point X is called a limit point of a set S if every open interval containing x 
contains points of S other than x- Prove that a set S" C M is closed if and only 
if it contains all its limit points. 

0.5 Compact Subsets of M 

One of the most important concepts for analysis is the notion of compactness. 

Definition 0.5.1. A closed set X G M. is called compact provided every open cover 
of X has a finite subcover. 

Less tersely, X is compact if for every collection V of open sets with the property 
that 

xc\Ju 

(76V 



0.5. COMPACT SUBSETS OF R 



11 



there is a finite collection Ui,U2, ■ ■ - Un of open sets in V such that 

n 

Xc(JUk. 

k=l 

For our purposes the key property is that closed and bounded subsets of R are 
compact. 

Theorem 0.5.2 (The Heine-Borel Theorem). A subset XofM. compact if and only 
if it is closed and bounded. 

Proof. To see that a compact set is bounded observe that f/„ = (— n, n) defines an 
open cover of any subset X of M. If this cover has a finite subcover for a set X, then 
X C Um for some m and hence X is bounded. To show a compact set X is closed 
observe that ii y ^ X, then = oo,y — ^) U (y+^,oo) defines an open cover 
of R \ {y} and hence of X. Since this cover of X has a finite subcover there is m > 
such that X C Um- It follows that {y — l/m,y + 1/m) is in the complement of X. 
Since y was an arbitrary point of the complement of X, this complement is open and 
X is closed. 

To show the converse we first consider the special case that X — [a, b] is a closed 
interval. Let V be an open cover of X and define 

z = supja; e [a, b] \ The cover V of [a, x] has a finite subcover}. 

Our aim is to prove that z = b which we do by showing that the assumption that 
z < b leads to a contradiction. There is an open set Uo E V with z G Uo- From the 
definition of open sets we know there arc points Zq,Zi G Uq satisfying zq < z < zi. 
Prom the definition of z the cover V of [a, Zq] has a finite subcover Ui,U2, ■ ■ - Un. 
Then the finite subcover C/q, Ui, U2, ■ ■ ■ Un of V is a cover of [a, ^i]. Since z < Zi this 
is contradiction arising from the assumption z < b. 

For an arbitrary closed bounded set X wc choose a, 6 G R such that X C [a, b]. If 
V is any open cover of X and we define Uq = M.\X, then V U {Uq} is an open cover 
of [a, b] which must have a finite subcover, say Uq, Ui, U2, . ■ . Un- Then Ui, U2, . . .Un 
must be a cover oi X. □ 

There is a very important property of nested families of bounded closed sets which 
we will use. 

Theorem 0.5.3 (Nested Families of Compact Sets). // {A„}5^]^ is a nested family 
of closed bounded subsets o/R, i.e. An C An-i, then D'^^iAn is non-empty. 
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Proof. Let x„ = ini An- Then {xn} is a bounded monotonic sequence so the hmit 
z = hm Xn exists by Theorem fl0.2.2p Since An is closed Xn € An and hence x„ G Am 
for all m < n. It follows that for any m > we have z G Am, i.e. z G flJ^j^An. □ 

Exercise 0.5.4. 

1. Prove that the set V = {m/2" | m G Z, n G M} is dense in M. 

2. Give an example of a nested family of non-empty open intervals t/i D f/2 ■ ■ ■ D 
Un... such that nf4 = 0. 

0.6 Continuous and Different iable Functions 

Definition 0.6.1 (Continuous and Uniformly Continuous Functions). A function 
/ : M ^ M zs continuous if for every x and every e > there is a 6{x) (depending 
on x) such that \f{y) — f{x)\ < e whenever \y — x\ < 6{x). A function / : M ^ M 
is uniformly continuous if for every e > there is a 6 ( independent of x and y ) such 
that \ f{y) — f{x)\ < e whenever \y — x\ < 6. 

Theorem 0.6.2. // / is defined and continuous on a closed interval [a, b] then it is 
uniformly continuous on that interval. 

Proof. Suppose e > is given. For any x G [a, b] and any positive number S let 
U{x, 6) = {x — 6,x + 6) From the definition of continuity it follows that for each x 
there is a S{x) > such that for every y G U{x,6{x)) we have \ f{x) — f{y)\ < e/2. 
Therefore if yi and ?/2 are both in U{x, S{x)) we note 

\f{yi) - f{y2)\ < \f{yi) - f{x)\ + |/(a:) - f{y,)\ <l + l = e. 

The collection {U{x, 6{x)/2) | x G [a, b]} is an open cover of the compact set [a, b] 
so it has a finite subcover {U{xi,6{xi)/2) | 1 < i < n}. Let 

5 = - min{5(xj) \ 1 < i < n}. 

Suppose now yi, y2 G [a, b] and \yi — 1/2I < Then yi is in U {xj, 6{xj)/2) for some 
1 < j < ''^ and 

6(x) 

\y2-Xj\ < \y2 -yi\ + \yi - Xj\ < 5 H ^ < S{xj). 

So both yi and y2 are in U{xj,6{xj)) and hence \ f{yi) — f{y2)\ < □ 
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We will also make use of the following result from elementary calculus. 

Theorem 0.6.3 (Mean Value Theorem). /// is is differentiable on the interval [a, b] 
then there is c & {a, b) such that 

fic) = M^M. 

b — a 

Corollary 0.6.4. // / and g are differentiable functions on [a, 6] and f'{x) = g'{x) 
for all X, then there is a constant C such that f{x) = g{x) + C. 

Proof. Let h{x) = f{x) — g{x), then h'{x) = for all x and we wish to show h is 
constant. But if ao,6o ^ [0',b], then the Mean Value Theorem says h{bo) — /i(ao) = 
/t'(c)(6o — oo) = since h'{c) = 0. Thus for arbitrary ao,6o ^ we have h{bo) = 
h{ao) so h is constant. □ 

Exercise 0.6.5. 

1. {Characterization of continuity) 
Suppose / is a function / : M M. 

(a) Prove that / is continuous if and only if the set inverse f~^{U) is open 
for every open set U gM.. 

(b) Prove that / is continuous if and only if the set inverse f~^{{a,b)) is 
open for every open interval (a, b). 

(c) Prove that / is continuous if and only if the set inverse f~^{C) is closed 
for every closed set C C M. 



0.7 Real Vector Spaces 

Definition 0.7.1 (Inner Product Space). A real vector space V is called an inner 
product space if there is a function ( , ) : V x V ^ M which for any vi,V2,w e V 
and any a, ci, C2 € M satisfies: 

1. Commutativity; {vi,V2) — {v2,v). 

2. Bi- linearity; (ci^i + C2V2, w) = ci{vi, w) + 02(^2, w). 

3. Positive Definiteness; {w,w) > with equality only if w = Q. 
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Definition 0.7.2 (Norm). If V is a real vector space with inner product { , ), we 
define the associated norm || || by \\v\\ = ^ {v, v). 

Proposition 0.7.3 (Cauchy-Schwarz Inequality). If{V,{ , )) is an inner product 
space and v,w e V, then 

\{v,w)\ < \\v\\ \\w\\, 
with equality if and only if v and w are multiples of a single vector. 

Proof. First assume \\v\\ — \\w\\ — 1. Then 

\\{v, w)w\\^ + 11^ — {v, w)w\\^ = {v, + {v — {v, w)w, v — {v, w)w) 

= {v,w)'^\\w\\'^ + \\v\\'^ — 2{v,w)'^ + (t", 

since = ll'^^IP — 1- Hence 

{v,w)'^ = \\{v,w)w\\^ < \\{v,w)w\\'^ + ||f — {v,w)w\\'^ = 1 

with equahty only if \\v — {v,w)w\\ — or v = {v. w)w. This implies the inequality 
< 1 = ||f II 11^11, when v and w are unit vectors. The result is trivial if cither 
w or w is 0. Hence we may assume the vectors are non-zero multiples v = avo and 
w — bwo of unit vectors vq and wq. In this case we have \{v,w)\ = \{avo,bwo)\ = 
\ab\\{vo,Wo)\ < \ab\ — \\avo\\ \\bwo\\ — \\v\\ \\w\\, 

Observe that we have equality only ii v — {v,w)w, i.e. only if one of the vectors 
is a multiple of the other. □ 

Proposition 0.7.4 (Normed Linear Space). If V is an inner product space and \\ \\ 
is the norm defined by \\v\\ — ^/ {v, v), then 

(1) For all a ^M. and v e V, \\av\\ = \a\ \\v\\. 

(2) For all v e V, II^H > with equality only if v = 0. 

(3) Triangle Inequality; For allv,w e V, < \\v\\ + \\w\\. 

(4) Parallelogram Law; For all v, w e V, 

+ ll-u + wlp = 211^1^ + 2||w|p. 
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Proof. The first two of these properties follow immediately from the definition of 
inner product. To prove item (3), the triangle inequality, observe 

11^; + wW^ — {v + w,v + w) 

— {v, v) + 2{v, w) + {w, w) 
= ||^;|P + 2(t;,w) + \\w\\^ 

< ||i;||^ + 2|(^;,'u;)| + ll^ll^ 

< + 2\\v\\ \\w\\ + ll^ll^ by Cauchy-Schwarz, 

-m\+\H\r 

To prove item (4), the parallelogram law, note \\v — w|p = {v — w,v — w) = 
\\v\\^ — 2{v, w) + \\w\\^. Likewise ||^; + w\\^ — {v + w,v + w) — ||f |p + 2{v, w) + ll^lp. 
Hence the sum \\v — wW^ + \\v + u^lp equals 2\\v\\'^ + 2\\w\\'^. □ 
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Chapter 1 

The Regulated and Riemann 
Integrals 



1.1 Introduction 

We will consider several different approaches to defining the definite integral 

/ f{x) dx 

J a 

of a function f{x). These definitions will all assign the same value to the definite 
integral, but they differ in the size of the collection of functions for which they are 
defined. For example, we might try to evaluate the Riemann integral (the ordinary 
integral of beginning calculus) of the function 




0, if X is rational; 

1, otherwise. 



The Riemann integral f{x) dx is, as we will see, undefined. But the Lebesgue 
integral, which we will develop, has no difficulty with f[x) and indeed f\x) dx = 1. 

There are several properties which we want an integral to satisfy no matter how 
we define it. It is worth enumerating them at the beginning. We will need to check 
them for our different definitions. 
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1.2 Basic Properties of an Integral 

We will consider the value of the integral of functions in various collections. These 
collections all have a common domain which, for our purposes, is a closed interval. 
They also are closed under the operations of addition and scalar multiplication. We 
will call such a collection a vector space of functions. More precisely a non-empty set 
of real valued functions V defined on a fixed closed interval will be called a vector 
space of functions provided: 

1. If/,^e V, then/ + ^e V. 

2. If / e V and r e R, then rf e V. 

Notice that this implies that the constant function is in V. All of the vector spaces 
we consider will contain all the constant functions. 

Three simple examples of vector spaces of functions defined on some closed interval 
/ are the constant functions, the polynomial functions, and the continuous functions. 

An "integral" defined on a vector space of functions V is a way to assign a real 
number to each function in V and each subinterval of /. For the function / G V and 
the subinterval [a, b] we denote this value by f(x) dx and call it "the integral of / 
from a to 6." 

All the integrals we consider will satisfy five basic properties which we now enu- 
merate. 

I. Linearity: For any functions f,g&V, any a,b e /, and any real numbers ci, C2, 



II. Monotonicity: If functions f,g e V satisfy f{x) > g{x) for all x and a,b E I 

satisfy a <b, then 




c\f{x) -\- C2g{x) dx — ci / f{x) dx + C2 / g{x) dx. 






In particular if f{x) > for all x and a < b then f{x) dx >0. 



III. Additivity: For any function / G V, and any a,b,c E I, 
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In particular we allow a, b and c to occur in any order on the line and we note 
that two easy consequences of additivity are 

pa pb pa 

/ f{x) dx — and / f{x) dx — — f{x) dx. 

J a J a J b 

IV. Constant functions: The integral of a constant function f{x) — C should be 
given by 

rb 

C dx^C{b-a). 



f 

J a 



If C > and a < b this just says the integral of / is the area of the rectangle 
under its graph. 

V. Finite Sets Don't Matter: If / and g are functions in V with f{x) = g{x) for 
all X except possibly a finite set, then for all a,b & I 

rb rb 



po po 

1 f{x) dx^ g{x) dx. 

J a J a 



Properties III, IV and V are not valid for all mathematically interesting theories 
of integration. Nevertheless, they hold for all the integrals we will consider so we 
include them in our list of basic properties. It is important to note that these are 
assumptions, however, and there are many mathematically interesting theories where 
they do not hold. 

There is one additional property which we will need. It differs from the earlier 
ones in that we can prove that it holds whenever the properties above are satisfied. 

Proposition 1.2.1 (Absolute Value). Suppose the integral f{x) dx has been de- 
fined for all f in some vector space of functions V and for all subintervals [a, b] of I. 
And suppose this integral satisfies properties I and II above. Then for any function 
f eV for which |/| e V 



' f{x) dx < f\f{x)\ dx, 

J a 



for all a < b in I. 



Proof. This follows from monotonicity and linearity. Since f{x) < for all x 

we know f{x) dx < \f{x)\ dx. Likewise —f{x) < \f{x)\ so — f{x) dx — 
J^—f{x) dx < J^\f{x)\ dx. But \ JI^ f{x) dx\ is either equal to JI^ f{x) dx or to 
— j'l f{x) dx. In either case \f{x)\ dx is greater so | f{x) dx\ < \f{x) \ dx. □ 
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1.3 Step Functions and the Regulated Integral 



The easiest functions to integrate are step functions which we now define. 

Definition 1.3.1 (Step Function). A function / : [a, 6] — > M is called a step function 
provided there numbers xq = a < xi < X2 < ■ ■ ■ < Xn-i < Xn = b such that f{x) is 
constant on each of the open intervals {xi-i,Xi). 

We will say that the points xq = a < Xi < X2 < ■ ■ ■ < Xn^i < Xn = b define an 
interval partition for the step function /. Note that the definition says that on the 
open intervals Xj) of the partition / has a constant value, say q, but it says 

nothing about the values at the endpoints. The value of / at the points Xi^i and Xi 
may or may or may not be equal to q. Of course when we define the integral this 
won't matter because the endpoints form a finite set. 

Since the area under the graph of a positive step function is a finite union of 
rectangles, it is pretty obvious what the integral should be. The i^^ of these rectangles 
has width {xi — Xi-i) and height q so we should sum up the areas Ci{xi — Xi-i). Of 
course if some of the q are negative, then the corresponding Ci{xi — Xi^i) are also 
negative, but that is appropriate since the area between the graph and the x-axis is 
below the x-axis on the interval {xi_i,Xi). 

Definition 1.3.2 (Integral of a step function). Suppose f{x) is a step function with 
partition Xq = a < Xi < X2 < ■ ■ ■ < Xn-i < Xn = b and suppose f{x) = Ci for 
Xi-i < X < Xi- Then we define 



Exercise 1.3.3. 

1. Prove that the collection of all step functions on a closed interval [a,b] is a 
vector space which contains the constant functions. 

2. Prove that ii xq = a < xi < X2 < ■ ■ ■ < a^n-i < a;„ = 6 is a partition for a step 
function / with value q on (a;.j_i, Xj) and = a < yi < y2 < ■ ■ ■ < yn-i < Um = 
b is another partition for the same step function with value dj on {yj-i, yi), then 




n 



m 
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In other words the value of the integral of a step function depends only on the 
function, not on the choice of partition. Hint: the union of the sets of points 
defining the two partitions defines a third partition and the integral using this 
partition is equal to the integral using each of the partitions. 

3. Prove that the integral of step functions as given in Definition 11.3.21 satisfies 
properties I-V of §1.2[ 

We made the "obvious" definition for the integral of a step function, but in fact, 
we had absolutely no choice in the matter if we want the integral to satisfy properties 
I-V above. 

Theorem 1.3.4. The integral as given in Definition \1.3.2\ is the unique real valued 
function defined on step functions which satisfies properties I- V of §i.M 

Proof. Suppose that there is another "integral" defined on step functions and satis- 
fying I-V. We will denote this alternate integral as 



Suppose that / has partition xq = a < Xi < X2 < ■ ■ ■ < < Xn = h and 

satisfies /(x) = q for < x < Xi. 
Then from the additivity property 



But on the interval the function f{x) is equal to the constant function with 

value Ci except at the endpoints. Since functions which are equal except at a finite 
set of points have the same integral, the integral of / is the same as the integral of q 
on Combining this with the constant function property we get 




What we must show is that for every step function /(x) 





(1.3.1) 
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If we plug this value into equation fll.3.ip we obtain 

/•b " pb 

(h f{x) dx = ^Ci{xi - Xi^i) = / f{x) dx. 

Ja Ja 

□ 

Recall the definition of a uniformly converging sequence of functions. 

Definition 1.3.5 (Uniform Convergence). A sequence of functions {fm} is said to 
converge uniformly on [a,b] to a function f if for every e > there is an M (inde- 
pendent of x) such that for all x G [a, b] 

— fm{x)\ < e whenever m > M. 

Contrast this with the following. 

Definition 1.3.6 (Pointwise Convergence). A sequence of functions {fm} is said to 
converge pointwise on [a, b] to a function f if for each e > and each x G [a, b] there 
is an Mx (clepending on x) such that 

\f{x) — fm{.x)\ < e whenever m > M.j.. 

Definition 1.3.7 (Regulated Function). A function f : [a,b] ^ M. is ca//ec? regulated 
provided there is a sequence {fm} of step functions which converges uniformly to f. 

Exercise 1.3.8. 

1. Prove that the collection of all regulated functions on a closed interval / is a 
vector space which contains the constant functions. 

2. Give an example of a sequence of step functions which converge uniformly to 
f{x) = X on [0,1]. Give an example of a sequence of step functions which 
converge pointwise to on [0, 1], but which do not converge uniformly. 

Every regulated function can be uniformly approximated as closely as we wish by 
a step function. Since we know how to integrate step functions it is natural to take 
a sequence of better and better step function approximations to a regulated function 
f{x) and define the integral of / to be the limit of the integrals of the approximating 
step functions. For this to work we need to know that the limit exists and that it 
does not depend on the choice of approximating step functions. 
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Theorem 1.3.9. Suppose {fm} is a sequence of step functions on [a, 6] converging 
uniformly to a regulated function f. Then the sequence of numbers {/^ fm{x) dx} con- 
verges. Moreover if {gm} is another sequence of step functions which also converges 
uniformly to f , then 

b 



lim / fmix) dx = lim / gm{x) dx. 

J a J a 



Proof. Let z^, = fm{x) dx. We will show that the sequence {zm} is a Cauchy 
sequence and hence has a limit. To show this sequence is Cauchy we must show that 
for any e > there is an M such that \zp — Zq\ < e whenever p,q > M. 

If we are given e > 0, since {/m} is a sequence of step functions on [a, b] converging 
uniformly to /, there is an M such that for all x 

\f{x) - fm{x)\ < 2(p'- a) ^^^"^^^^^ m>M. 
Hence whenever, p,q > M 

\f,{x) - f,{x)\ < \f,{x) - f{x)\ + \f{x) - f,{x)\ < 2^^^ = (1.3.2) 

Therefore, whenever p,q > M 

\zp-Zq\= / fp{x) - fq{x) dx < / \fp{x) - fq{x)\ dx < / —^dx = e, 



where the first inequality comes from the absolute value property of Proposition II. 2. 11 
and the second follows from the monotonicity property and equation ( 11.3.2^ . This 
shows that the sequence {zm} is Cauchy and hence converges. 

Now suppose that {gm} is another sequence of step functions which also converges 
uniformly to /, then for any e > there is an M such that for all x 

\f{x) - fmix)\ < t and \ f{x) - gra{x)\ < t 

whenever m > M. It follows that 

\fmix) - grnix)\ < |/„^(x) + < 2e. 

Hence, using the absolute value and monotonicity properties, we see 



r-6 



fm{x) - gm.{x) dx 



< 



pb pb 

/ \fm{x) - gm{x)\ dx < / 2e dx = 2e{b - a), 

J a J a 
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for all m > M. Since e is arbitrarily small we may conclude that 



lim 

m— >oo 



b rb 



fm{x)dx- / gm{x) dx = lim / fm{x) - gm{x) dx 



m— >oo 



b 



0. 



This implies 



pb pb 

lim / = lim / gm{x) dx. 



□ 

This result enables us to define the regulated integral. 

Definition 1.3.10 (The Regulated Integral). /// is a regulated function on [a,b] we 
define the regulated integral by 

b pb 

f{x) dx = lim / fnix) dx 

n— >oo / 
J a 

where {/„} is any sequence of step functions converging uniformly to f. 

We next need to see that the regulated functions form a large class including all 
continuous functions. 

Theorem 1.3.11 (Continuous functions are regulated). Every continuous function 
f : [a,b] M is a regulated function. 

Proof. By Theorem l0.6.2l a continuous function f{x) defined on a closed interval [a, b] 
is uniformly continuous. That is, given e > there is a corresponding 6 > such that 
\f{x) — f{y)\ < e whenever |x — ?/| < S. Let e„, = 1/2"' and let 5„ be the corresponding 
6 guaranteed by uniform continuity. 

Fix a value of n and choose a partition xq = a < Xi < X2 < ■ ■ ■ < Xm = b 
with Xi — < 6n. For example, we could choose m so large that if we define 
Ax = {b — a)/m then Ax < 6n and, then we could define Xi to be a + iAx. Next we 
define a step function /„ by 

fn{x) = f{xi) for all X G [xi-i,Xi[. 

That is, on each half open interval we define /„ to be the constant function 

whose value is the value of / at the left endpoint of the interval. The value of /«(&) 
is defined to be f{b). 
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Clearly fn{x) is a step function with the given partition. We must estimate its 
distance from /. Let x be an arbitrary point of [a, b]. It must lie in one of the open 
intervals of the partition or be an endpoint of one of them; say x G Then 
since fn{x) = fn{xi-i) = f{xi^i) we may conclude 

|/(x)-/„(x)|<|/(x)-/(x,_i)|<e„ 

because of the uniform continuity of / and the fact that \x — Xi-i\ < 6n- 

Thus we have constructed a step function /„ with the property that for all x G [a, b] 

\f{x)-fn{x)\ < e„. 

So the sequence {/„} converges uniformly to / and / is a regulated function. □ 
Exercise 1.3.12. 

1. Give an example of a continuous function on the open interval (0, 1) which is 
not regulated, i.e. which cannot be uniformly approximated by step functions. 

2. Prove that the regulated integral, as given in fll.3.10p . satisfies properties I-V 
of CI 

3. Prove that / is a regulated function on / = [a, b] if and only if both the limits 

lim f{x) and lim f{x) 

exist for every c G (a, b). (See section VII. 6 of Dieudonnc \T\). 

1.4 The Fundamental Theorem of Calculus 

The most important theorem of elementary calculus asserts that if / is a continuous 
function on [a,b], then its integral f{x) dx can be evaluated by finding an anti- 
derivative. More precisely, if F{x) is an anti-derivative of /, then 




fix) dx = F{b)-F{a). 



We now can present a rigorous proof of this result. We will actually formulate the 
result slightly differently and show that the result above follows easily from that 
formulation. 
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Theorem 1.4.1. /// is a continuous function and we define 

F{x)= r fit) dt 



then F is a differentiable function and F'{x) = f{x). 
Proof. By definition 



F'(xo) — lim 



F{xo + h)- F{xo) 
h 



so we need to show that 



r F{xo + h)-F{xo) 

hm = f{xo). 



or equivalently 



hm 



F{xo + h)- F{xo) 



To do this we note that 

F{xo + h)- F{xo) 



h 



h 



- f{xo) 



sTmdt 



h 



r^'f{t)dt-f{x,)h 



h 



LT'im - f{xo)) dt\ 



\h\ 



Monotonicity teUs us that when h is positive 



/ {f{t)-f{x^))dt < / \f{t)-f{xo)\ 
Jxo Jxo 



dt 



and if h is negative 



;i.4.i) 



/ {fit) - f{xo)) dt < / \fit) - f{xo)\ dt^- \f{t) - f{xo)\ 

J xo J xo+h J xo 



dt. 



In either case we see 

"xo+h 



(fit) - f{xo)) dt 



xo 



< 



xo+h 



\f{t)-f{xo)\dt 



xo 
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Combining this with inequahty fll.4.ip above we obtain 



Fjxo + h)- Fjxo) 
h 



< 



i::^'\m-f{xo)\dt\ 

\h\ 



;i.4.2) 



But the continuity of / imphes that given xq and any e > there is a 5 > 
such that whenever \t — xo\ < 6 we have \f{t) — f{xo)\ < e. Thus if \h\ < 6 then 



|/(t) — f{xo)\ < e for all t between xq and xq + h. It follows that | J^^^ 
/(xo)| dt\ < e\h\ and hence that 



i::^'\f{t)-f{xo)\dt\ 

\h\ 



< e. 



Putting this together with equation fll.4.2p above we have that 

F{xo + h)~ F{xo) 



h 



< e 



whenever |/;.| < 5 which was exactly what we needed to show. □ 

Corollary 1.4.2. Fundamental Theorem of Calculus. /// is a continuous func- 
tion on [a,b] and F is any anti- derivative of f , then 



I f{x) dx = F{b) - F{a) 

J a 



Proof. Define the function G{x) = f{t) dt. By Theorem (11.4.ip the derivative of 
G{x) is f{x) which is also the derivative of F. Hence F and G differ by a constant, 
say F{x) = G{x) + C (see Corollary flCTOD ). 
Then 

F{h) - F{a) = (Gib) + C)- (Gia) + C) 
= G{b) - G{a) 

' pa 

f{x) dx — I f{x) dx 



f{x) dx. 



Exercise 1.4.3. 

1. Prove that if / : [a, 6] — > M is a regulated function and F : [a, b] 
to by F{x) = f{t) dt then F is continuous. 



□ 



is defined 
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1.5 The Riemann Integral 

We can obtain a larger class of functions for which a good integral can be defined by 
using a different method of comparing with step functions. 

Suppose that f{x) is a bounded function on the interval I = [a,b] and that it is 
an element of a vector space of functions which contains the step functions and for 
which there is an integral defined satisfying properties I-V of §1.21 If u{x) is a step 
function satisfying f{x) < u{x) for all x E I, then monotonicity implies that if we 
can define f{x) dx it must satisfy f{x) dx < u{x) dx. 

This is true for every step function u satisfying f{x) < u{x) for all x G /. Let 
U{f) denote the set of all step functions with this property. Then if we can define 

f{x) dx in a way that satisfies monotonicity it must also satisfy 

f{x) dx<m.{l^j^u{x) dx mgW(/)}. (1.5.1) 

The infimum exists because all the step functions in U{f) are bounded below by a 
lower bound for the function /. 

Similarly we define C{f) to be the set of all step functions v{x) such that v{x) < 
f{x) for all X G /. Again if we can define j^f{x) dx in such a way that it satisfies 
monotonicity it must also satisfy 

sup I v{x) dx V e C{f) } < fi^) dx. (1.5.2) 

The supremum exists because all the step functions in U{f) are bounded above by an 
upper bound for the function /. 

Putting inequalities ( 11.5.ip and ( 11.5.2^ together, we see if V is any vector space 
of bounded functions which contains the step functions and we manage to define the 
integral of functions in V in a way that satisfies monotonicity, then this integral must 
satisfy 

sup I ^ v{x) dx V e /:(/)| < ^ fix) dx < inf I ^ u{x) dx u E W(/)} (1.5.3) 



for every / G V. Even if we can't define an integral for /, however, we still have the 
inequalities of the ends. 
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Proposition 1.5.1. Let f be any bounded function on the interval I = [a.b]. LetU{f) 
denote the set of all step functions u{x) on I such that f[x) < u{x) for all x and let 
£(/) denote the set of all step functions v{x) such that v{x) < f{x) for all x. Then 

sup I J v{x) dx V E < inf I J u{x) dx u E 

Proof. If f G C{f) and u G ti{f), then v{x) < f{x) < u{x) for all x G / so mono- 
tonicity implies that J^v{x) dx < JI^u{x) dx. Hence if 



V 



I J v{x) dx V E and ^ ~ J ^^''^^ '^^ ^ ^ ^(/)| 



then every number in the set V is less than or equal to every number in the set U. 
Thus sup V < inf U as claimed □ 

It is not difficult to see that sometimes the two ends of this inequality are not 
equal (see Exercise fll.5.4p below), but if it should happen that 

sup I J v{x) dx V E = inf I J u{x) dx u E U{f) 

then we have only one choice for J^^ f{x) dx; it must be this common value. 

This motivates the definition of the next vector space of functions we can integrate. 
Henceforth we will use the more compact notation 

sup < / v{x) dx> instead of sup < / v{x) dx v E C{f) 

v&CU) ^ Ja J Wa 



and 



inf 

u&AU) 



I j u{x) dx^ instead of inf | j u{x) dx u E 



Definition 1.5.2. The Riemann Integral. Suppose f is a bounded function on 
the interval I = [a,b]. Let U{f) denote the set of all step functions u{x) on I such 
that f{x) < u{x) for all x and let C{f) denote the set of all step functions v{x) such 
that v{x) < f{x) for all x. The function f is said to be Riemann integrable provided 



sup 



v{x) dx 



inf 

«ew{/) 



u{x) dx ?. 



In this case its Riemann integral JI^ f{x) dx is defined to be this common value. 
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Theorem 1.5.3. A bounded function f : [a,b] —>■ R is Riemann integrable if and only 
if, for every e > there are step functions vq and uq such that vo{x) < f{x) < uo{x) 
for all X e [a, b] and 

rb pb 
uo{x) dx — vo{x) dx < e. 

J a 

Proof. Suppose the functions Vq G C{f) and Uq G U{f) have integrals within e of each 
other. Then 



/ Vnix) dx < sup < / v(x) dx> < inf < / u(x) dx 

J a veC(f) >- Ja J «e^^(/) ^ Ja 



< I uo{x) dx. 



:(/) 

This implies 

inf < / u(x) dx} — sup < / v(x) dx> < e. 

u&AU) ^Ja J veC(f) ^Ja ^ 

Since this is true for all e > we conclude that / is Riemann integrable. 

Conversely if / is Riemann integrable, then by Proposition 10.1.41 there exists a 
step function uq G such that 

/ Unix) dx — fix) dx = Uo(x) dx — inf < / u(x) dx> < e/2. 
Similarly there exists a step function vq G C{f) such that 

I pb 

f{x) dx — Vo{x) dx < e/2. 



Hence 

pb pb 

/ Uo{x) dx — voix) dx < e/2 + e/2 = e. 

J a J a 

and Mo and vq are the desired functions. □ 
Exercise 1.5.4. 

1. At the beginning of these notes we mentioned the function / : [0, 1] — > M which 
has the value f{x) = if x is rational and 1 otherwise. Prove that for this 
function 

sup < / v(x) dx> = and inf < / u(x) dx> = 1. 
Hence / is not Riemann integrable. 
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There are several facts about the relation with the regulated integral we must 
establish. Every regulated function is Riemann integrable, but there are Riemann 
integrable functions which have no regulated integral. Whenever a function has both 
types of integral the values agree. We start by giving an example of a function which 
is Riemann integrable, but not regulated. 



Example 1.5.5. Define the function / : [0, 1] — > M by 

fix)- 



1, if X = - for n G Z"* 
0, otherwise. 



Then f(x) is Riemann integrable and f{x) dx = but it is not regulated. 
Proof. We define a step function Um{x) by 



Um\X 



1, ifO<x<^] 
f{x), otherwise. 



A partition for this step function is given by 

1 1 1 

xo = < xi = — < a;2 = < ■ ■ ■ < Xm-i = - < x^ = 1. 

m m — 1 2 

Note that Umix) > /(x). Also Umix) dx = 1/m. This is because it is constant 
and equal to 1 on the interval [0, 1/m] and except for a finite number of points it is 
constant and equal to on the interval [1/m, 1]. Hence 

inf I / u(x) dx\ < inf I / Umix) dx\ = inf | — | = 0. 

ueu{f) I Jq J mez+ I Jq J m&+ m 

Also the constant function is < /(x) and its integral is 0, so 

< sup < / f (x) dx >. 
vec(f) ^ Jo ^ 

Putting together the last two inequalities with Proposition fll.S.ip we obtain 
< sup < / v(x) dx>< inf < / u(x) dx><0. 
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So all of these inequalities are equalities and by definition, / is Riemann integrable 
with integral 0. 

To see that / is not regulated suppose that g is an approximating step function 
with partition xq = < xi < ■ ■ ■ < Xm = 1 and satisfying |/(a;) — g{x)\ < e for some 
e > 0. Then g is constant, say with value Ci on the open interval (0,a;i). 

Now there are points ai,a2 G (0,a;i) with /(ci) = and /(a2) = 1- Then |ci| = 
|ci — 0| = \g{ai) — /(cii)l < e and |1 — ci| = |/(a2) — g{cL2)\ < e- But |ci| + |1 — ci| > 
|ci + 1 — ci| = 1 so at least one of |ci| and |1 — ci| must be > 1/2. This implies that 
e > 1/2. That is, / cannot be uniformly approximated by any step function to within 
e if e < 1/2. So / is not regulated. □ 

Theorem 1.5.6 (Regulated functions are Riemann integrable). Every regulated func- 
tion f is Riemann integrable and the regulated integral of f is equal to its Riemann 
integral. 

Proof. If / is a regulated function on the interval / = [a, b], then, for any e > 0, it can 
be uniformly approximated within e by a step function. In particular, if e„ = 1/2" 
there is a step function such that \f{x)—gn{x) \ < e„ for all x G /. The regulated 
integral f{x) dx was defined to be lim gn{x) dx. 

We define two other approximating sequences of step functions for /. Let Un{x) = 
gn{x) + 1/2" and f„(x) = gn{x) — 1/2". Then Un{x) > f{x) for all a; G / because 
Unix) - f{x) = 1/2" + gn{x) - f{x) > since \gnix) - f{x)\ < 1/2". Similarly 
Vn{x) < f{x) for all X G / because f{x) — f„(x) = 1/2" + f{x) — gn{x) > since 
|/(x)-(?„(x)| <l/2". 

Since Unix) - Vn{x) = g^ix) + 1/2" - (g^ix) - 1/2") = 1/2"-^, 

Unix) dx- J Vn{x) dx = J M„(x) - Vn{x) dx = J dx = 

Hence we may apply Theorem (11. 5. 3 p to conclude that / is Riemann integrable. 
Also 

b rb ^ ' 



lim / gnix) dx = lim / f„(x) H dx = lim / Vn{x) dx, and 

n^ooj^ n^ooj^ 2" A 

pb r-b r-b 

lim / gn{x) dx = lim / Un{x) dx = lim / Un{x) dx. 

n—ioo / n— >oo / 2" n—>oo 



Since for all n 

rb rb pb 



Vn{x) dx < f{x) dx < Un{x) dx 
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we conclude that 

pb pb 

lim / gn{x) dx — I f{x) dx. 

n— >oo /„ /„ 

That is, the regulated integral equals the Riemann integral. □ 

Theorem 1.5.7. The set TZ of Riemann integrable functions on an interval I — [a, b] 
is a vector space containing the vector space of regulated functions. 

Proof. We have already shown that every regulated function is Riemann integrable. 
Hence we need only show that whenever f,geTZ and r e IR we also have {f + g)eTl 
and rf e TZ. We will do only the sum and leave the product as an exercise. 

Suppose e > is given. Since / is Riemann integrable there are step functions Uf 
and Vf such that Vf{x) < f{x) < Uf{x) for a; e / (i.e. Uf e U{f) and Vf e and 
with the property that 



pb pb 

' Uf{x) dx — I Vf{x) dx 

a J a 



< e. 

Similarly there are Ug G U{g) and Vg G C{g) with the property that 

pb pb 

/ Ug{x) dx — Vg{x) dx < €. 

J a J a 

This implies that 

pb pb 

/ {uf + Ug){x) dx — / (vf + Vg){x) dx < 2e. 

J a J a 

Since {uf + Ug) & U{f + g) and {vf + Vg) G C{f + g) we may conclude that 
inf < / u{x) dx \ — sup < / v{x) dx \ < 2e. 

u&^U+g) ^ Ja J veC{f+g)^Ja ^ 

As e > is arbitrary we conclude that 

inf < / u(x) dx> = sup < / v(x) dx> 

ueU{f+g) J veC{f+g) ^Ja ^ 

and hence {f + g) E TZ. □ 
Exercise 1.5.8. 

1. Prove that if / and g are Riemann integrable functions on an interval [a,b], 
then so is fg. In particular if r G R, then rf is a Riemann integrable function 
on [a, b]. 
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Chapter 2 
Lebesgue Measure 



2.1 Introduction 



In the previous section we studied two definitions of integral that were based on two 
important facts: (1) There is only the one obvious way to define the integral of step 
assuming we want it to satisfy certain basic properties, and (2) these properties force 
the definition for the integral for more general functions which are uniformly ap- 
proximated by step functions (regulated integral) or squeezed between step functions 
whose integrals are arbitrarily close (Riemann integral). 

To move to a more general class of functions we first find a more general notion 
to replace step functions. For a step function / there is a partition of / = [0, 1] into 
intervals on each of which / is constant. We now would like to allow functions for 
which there is a finite partition of / into sets on each which / is constant, but with 
the sets not necessarily intervals. For example we will consider functions like 



The interval / is partitioned into two sets A = I (1 Q and B = I (1 Q*^, i.e. the 
rational points of / and the irrational points. Clearly the integral of this function 
should be 31en(74) + 21en(5), but only if we can make sense of len(74) and len{B). 
That is the problem to which this chapter is devoted. We want to generalize the 
concept of length to include as many subsets of IR as we can. We proceed in much 
the same way we did in previous chapters. We first decide what are the "obvioTis" 
properties this generalized length must satisfy to be of any use, and, then try to 




(2.1.1) 
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define it by approximating with simpler sets where the definition is clear, namely sets 
of intervals. 

The generalization of length we want is called Lebesgue measure. Ideally we would 
like it to work for any subset of the interval / = [0, 1], but it turns out that it is not 
possible to achieve that. 

There are several properties which we want any notion of "generalized length" 
to satisfy. For each bounded subset A of M we would like to be able to assign a 
non- negative real number that satisfies the following: 

I. Length, li A = (a, b) or [a, b] then iJ,{A) — len(A) — b — a, i.e. the measure of an 

open or closed interval is its length 

II. Translation Invariance. If A C M is a bounded subset of M and c e IR., then 

Ijl{A -\- c) — iJi{A), where A-\- c denotes the set {x + c | x e A}. 

III. Countable Additivity. If {^n}^i is a countable collection of bounded subsets 

of R, then 

oo oo 
n=l n=l 

and if the sets are pairwise disjoint, then 

oo oo 
n=l n=l 

Note the same conclusion applies to finite collections {>ln}™=i of bounded sets 
(just let ^4^ = for i > m). 

IV. Monotonicity U A C B, then < ij.{B). Actually, this property is a conse- 

quence of additivity since A and B \A are disjoint and their union is B. 

It should be fairly clear why we most of these properties arc absolutely necessary 
for any sensible notion of length. The only exception is property 111, which deserves 
some comment. We might ask that additivity only hold for finite collections of sets, 
but that is too weak. For example, if we had a collection of pairwise disjoint intervals 
of length 1/2, 1/4, 1/8, .. . 1/2", . . . , etc., then we would certainly like to be able say 
that the measure of their union is the sum ^1/2" = 1 which would not follow 
from finite additivity. Alternatively, one might wonder why additivity is only for 
countable collections of pairwise disjoint sets. But it is easy to see why it would lead 
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to problems if we allowed uncountable collections. If {x} is the set consisting of a 
single point x G [0,1], then fi{{x}) = by property I and [a.b] is an uncountable 
union of pairwise disjoint sets, namely each of the sets consisting of one point of [a, b]. 
Hence we would have //([a, b]) — b — a is an uncountable sum of zeroes. This is one 
reason the concept of uncountable sums isn't very useful. Indeed, we will see that the 
concept of countability is intimately related to the concept of measure. 

Unfortunately, as mentioned above, it turns out that it is impossible to find a fi 
which satisfies I-IV and which is defined for all bounded subsets of the reals. But we 
can do it for a very large collection which includes all the open sets and all the closed 
sets. The measure we are interested in using is called Lebesgue measure Its actual 
construction is slightly technical and wc have relegated that to an appendix. Instead 
we will focus the properties of Lebesgue measure and how to use it. 

2.2 Null Sets 

One of our axioms for the regulated integral was, "Finite sets don't matter." Now 
we want to generahze that to say that sets whose "generalized length," or measure, 
is zero don't matter. It is a somewhat surprising fact that even without defining 
Lebesgue measure in general we can easily define those sets whose measure must be 
and investigate the properties of these sets. 

Definition 2.2.1 (Null Set). A set X G M. is called a null set provided for every 
e > there is a collection of open intervals {Un}'^=i such that 

oo oo 

len(t/„) < e and X C [JUn- 

n=l n=l 

Perhaps surprisingly this definition makes no use of the measure fi. Indeed, we 
have not yet defined the measure of a set X for any choice of the set X! However, 
it is clear that if we can do so in a way that satisfies properties I-IV above and the 
Hence to denote X has measure zero, we will write IJ>{X) — even though we have 
not yet defined fi. 

If X is a null set in / = [0, 1] we will say that its complement X'^ has full measure 
in /. 

Exercise 2.2.2. 

1. Prove that a finite set is a null set. 
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2. Prove that a countable union of null sets is a null set (and hence, in particular, 
countable sets are null sets). 

3. Assuming that a measure /i has been defined and satisfies properties I-IV above, 
find the numerical value of the integral of the function f{x) defined in Equation 
(12. 1.11) . Prove that the Riemann integral of this function does not exist. 

It is not true that countable sets are the only sets which are null sets. We give an 
example in Exercise (12.6. ip below, namely, the Cantor middle third set, which is an 
uncountable null set. 

2.3 Sigma algebras 

As mentioned before there does not exist function /i satisfying properties I-IV and 
defined for every subset of / = [0,1]. In this section we want to consider what is 
the best collection of subsets of / for which we can define a "generalized length" or 
measure /i. Suppose we have somehow defined fi for all the sets in some collection 
A of subsets of / and it satisfies properties I-IV. Property I only makes sense if fi is 
defined for open and closed intervals, i.e. we need open and closed intervals to be in 
A. For property III to make sense we will need that any countable union of sets in 
A is also in A. Finally it seems reasonable that if A is a set in the collection A, then 
the set Z"^, its complement in /, should also be in A. 
All this motivates the following definition. 

Definition 2.3.1 (Sigma algebra). Suppose X is a set and A is a collection of subsets 
of X. A is called a cr-algebra of subsets of X provided it contains the set X and is 
closed under taking complements (with respect to X), countable unions, and countable 
intersections. 

In other words if ^ is a a-algebra of subsets of X, then any complement (with 
respect to X) of a set in A is also in A, any countable union of sets in A is in 
A, and any countable intersection of sets in A is in A. In fact the property about 
countable intersections follows from the other two and Proposition fl0.3.3p which says 
the intersection of a family of sets is the complement of the union of the complements 
of the sets. Also notice that if A, i? G A, then their set difference A \ B = {x & 
A \ X ^ B} ism A because A\B = Ar\ B". 

Since X is in any cr-algebra of subsets of X (by definition), so is its complement, 
the empty set. A trivial example of a a-algebra of subsets of X is ^ = {X, 0}, i.e. it 
consists of only the whole set X and the empty set. Another example is ^ = 'P(X), 
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the power set of X, i.e. the collection of all subsets of X. Several more interesting 
examples are given in the exercises below. Also in these exercises we ask you to show 
that any intersection of cr-algebras is a cr-algebra. Thus for any collection C of subsets 
of R there is a smallest cr-algebra of subsets of IR which contains all sets in C, namely 
the intersection of all cr-algebras containing C. 

Definition 2.3.2 (Borel Sets). If C is a collection of subsets ofM. and A is the the 

smallest a-algebra of subsets of M which contains all the sets of C then A is called 
the a-algebra generated by C. Let B be the a-algebra of subsets o/M generated by the 
collection of all open intervals. E is called the Borel cr-algebra and elements of B are 
called Borel sets. 

In other words B is the collection of subsets of M which can be formed from 
open intervals by any finite sequence of countable unions, countable intersections or 
complements. 

Exercise 2.3.3. 

1. Let A = {X G I \ X is countable, or X'^ is countable}. Prove that ^ is a cr- 
algebra. 

2. Let A = {X C / I X is a null set, or X'^ is a null set}. Prove that ^ is a cr- 
algebra. 

3. Suppose A\ is a cr-algebra of subsets of X for each A in some indexing set A. 
Prove that 

A=f]Ax 

is a cr-algebra of subsets of X. 

4. Let A he a. cr-algebra of subsets of M and suppose / is a closed interval which is 
in A. Let A{I) denote the collection of all subsets of / which are in A. Prove 
that A{I) is a cr-algebra of subsets of /. 

5. Suppose Ci is the collection of closed intervals in M, 
C2 is the collection of all open subsets of M, and 

C3 is the collection of all closed subsets of M. 

Let Ai be the cr-algebra generated by Prove that Bi,B2, and B3 are all equal 
to the Borel cr-algebra B. 
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2.4 Lebesgue Measure 

The (T-algebra of primary interest to us is the one generated by Borel sets and null 
sets. Alternatively, as a consequence of part 5. of Exercise ( I2.3.3p . it is the a-algebra 
of subsets of M generated by open intervals, and null sets, or the one generated by 
closed intervals and null sets. 

Definition 2.4.1. The a-algebra of subsets o/M generated by open intervals and null 
sets will be denoted Ai. Sets in Ai will be called Lebesgue measurable, or measurable 
for short. If I is a closed interval, then A4{I) will denote the Lebesgue measurable 
subsets of L 

For simplicity we will focus on subsets of / = [0, 1] though we could use any other 
interval just as well. Notice that it is a consequence of part 4. of Exercise ( 12.3.3P 
that is a cr-algebra of subsets of /. It is by no means obvious that A4 is not 

the (T-algebra of all subsets of M. However, in section ([B]) of the appendix we will 
construct a subset of I which is not in Ai. 

We are now ready to state the main theorem of this Chapter. 

Theorem 2.4.2 (Existence of Lebesgue Measure). There exists a unique function fi, 
called Lebesgue measure, from Ai{I) to the non-negative real numbers satisfying: 

I. Length. If A = {a,b) then fi{A) = \en{A) = b — a, i.e. the measure of an open 

interval is its length 

II. Translation Invariance. Suppose A G I, c G M and A + c (Z I where A + c 

denotes the set {x + c \ x E A}. Then ii{A + c) = 

III. Countable Additivity. // {v4„}^]^ is a countable collection of subsets of I, 

then 

oo oo 

fl{\jA^)<J2KAn) 

n=l n=l 

and if the sets are pairwise disjoint, then 

oo oo 
n=l n=l 

IV. Monotonicity If A, B G M{I) and A C B then fi{A) < 12(B) 

V. Null Sets A subset A G I is a null set set if and only if A G Ai{I) and fJ^{A) = 0. 
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Note that the countable additivity of property III imphes the analogous statements 
about finite additivity. Given a finite collection {An}^^^^ of sets just let A, = for 
i > m and the analogous conclusions follow. 

We have relegated the proof of most of this theorem to Appendix A, because it 
is somewhat technical and is a diversion from our main task of developing a theory 
of integration. However there are some properties of Lebesgue measure we can easily 
derive; so we do so now. For example, we will use properties I-III of Theorem (12.4.21) 
to prove property IV. 

Proposition 2.4.3 (Monotonicity). If A,B e M{I) and A C B, then < 

Proof. Since A G B we have B = AU {B \ A). Also A and B \ A are disjoint so 
by property III we know fi{A) + fx{B \ A) = fi{B). But fx{B \ A) > so ii{A) < 
fx{A) + fx{B\A) = fi{B). □ 

Proposition 2.4.4. If X C I is a null set, then X E M{I) and fi{X) = 0. 

Proof. If X C / is a null set, then by the definition of M.{I) we know X E Ai{I). If 
e > there is a collection of open intervals {Un}'^=i such that 

oo oo 

^len(f/„) < e and X C [J Un- 

71=1 n=l 

Property I says len(f/„) = fi{Un), so 

oo oo 

J]/i(f^n) = 5^1en(f/„)<e. 

Ji=l n=l 

Since X C UUn properties II and III imply 

oo oo 
^i{X)<^i{\JUn)<Y,l^iUn)<e, 

n=l 71=1 

This is true for any e > so the only possible value for fi{X) is zero. □ 

Recall that set difference A\B = {xeA\x^ B}. Since we are focusing on 
subsets of / complements are with respect to I so A'^ = I \ A. 

Proposition 2.4.5. 

(1) The Lebesgue measure of I, yu(/), is 1 and hence fi{A^) = 1 — yu(A). 
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(2) If A andB areinM{I), then A\B is in M andii{AUB) ^ ii{A\ B) + fj,{B) . 

Proof. To see (1) observe that A and A'^ are disjoint and A\J A'^ — I, so additivity 
implies fi{A) + /x(A^) = fi{A U A") = fi{I) = 1. 

For (2) note that A \ B = A Ci B'^ which is in Also A\B and B are disjoint 
and their union is A\JB. So once again additivity implies implies ^{A \B) + f^{B) = 
fx{ALiB). □ 

If we have a countable increasing family of measurable sets then the measure of 
the union can be expressed as a limit. 

Proposition 2.4.6. If Ai (Z A2 (Z • ■ ■ (Z An . . . is an increasing sequence of measur- 
able subsets of I, then 

00 

J An) = lim IJ,{An). 

n=l 

// i?i D ^2 D • • • D S„ . . . is a decreasing sequence of measurable subsets of I, then 

00 

fi{r]Bn)= hm 

■ ' n— >oo 

n=l 

Proof. Let Fi — Ai and F„ = An\An-i for n > 1. Then {-Fn}^i are pairwise disjoint 
measurable sets, A^ — Uf^^Fj and U^^Aj = U^^Fj. Hence by countable additivity 
we have 

00 00 00 

i=l i=l i=l 



= lim = lim l^{\jFi) 

n— >oo ' n— >oo 

i=l i=l 

= lim n{An). 

n— »oo 

For the decreasing sequence we define En — B^. Then {-Enj^i is an increasing 
sequence of measurable functions and 

00 00 

n=l n=l 

Hence 

00 00 
//( n S„) = 1 - /i(U Fi) = 1 - lim i^{En) = lim (1 - //(K)) = lim 

■ ' n— »oo n— +00 n— ♦00 
n=l 1=1 

□ 
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2.5 The Lebesgue Density Theorem 

The following theorem asserts that if a subset of an interval I is "equally distributed" 
throughout the interval then it must be a null set or the complement of a null set. 
For example it is not possible to have a set A C [0, 1] which contains half of each 
subinterval, i.e. it is impossible to have fi{An[a, b]) = fi{[a, b])/2 for all < a < 6 < 1. 
There will always be small intervals with a "high concentration" of points of A and 
other subintervals with a low concentration. Put another way, it asserts that given 
any p < 1 there is an interval U such that a point in U has probability > p of being 
in A. 

Theorem 2.5.1. If A is a Lebesgue measurable set and > and ifO<p<l, 
then there is an open interval U = {a, b) such that fi{A fl f/) > pn{U) = p{b — a). 

Proof. Let p G (0, 1) be given. We know from the definition of outer measure and the 
fact that = fJ'iA), that for any e > there is a countable open cover {Un}'^=i 

of A such that 

oo 

< J]len(?7„) </i(A) + e. 

n=l 

Choosing e = (1 — p)fi{A) we get 

oo 

^len(f/„)</i(A) + (l-j9)M^) 

n=l 

oo 

</i(A) + (l-p)^len([/„) 

n=l 

SO 

oo oo 

pJ2^en{Un) < M^) < J]/^(An f/„). (2.5.1) 

71=1 n=l 

where the last inequality follows from subadditivity. Since these infinite series have 
finite sums, there is at least one uq such that Pl^'iPn^) < fl Un^)- This is because 
if it were the case that pfj,{Un) > fi{A fl Un) for all n, then it would follow that 
p^^^len(f/n) > ^^i/i(^n?7„) contradicting equation (12.5.11) . The interval [/„o is 
the U we want. □ 

There is a much stronger result than the theorem above which we now state, but 
do not prove. A proof can be found in Section 9.2 of 0. 



44 



CHAPTER 2. LEBESGUE MEASURE 



Definition 2.5.2. If A is a Lebesgue measurable set and x & A, then x is called a 
Lebesgue density point if 

fxiA n [x - e, X + e\) 
lim 77 — = 1. 

e^o jj,[[x — e, X + e\) 

Theorem 2.5.3 (Lebesgue Density Tlieorem). If A is a Lebesgue measurable set, 
then there is a subset E G A with n{E) = such that every point of A \ E is a 
Lebesgue density point. 

2.6 Lebesgue Measurable Sets - Summary 

In this section we provide a summary outline of the key properties of collection }A 
of Lebesgue measurable sets which have been developed in this chapter. Recall / is 
a closed interval and M.{I) denotes the subsets of / which are in /. 

1. The collection of Lebesgue measurable sets is a a -algebra, which means 

mllAeM, then A" e M. 

• If A„ e Al for n e N, then Ur=i e M. 

• If ^„ e At for n e N, then 0^=1 e M. 

2. All open sets and all closed sets are in M.. Any null set is in Al. 

3. If A G M.{I), then there is a real number ii{A) called its Lebesgue measure 
which satisfies: 

• The Lebesgue measure of an interval is its length. 

• Lebesgue measure is translation invariant. 

• li AeM, then = 1 - 

• \i A E M. is a, null set if and only if = 0. 

• Countable Subadditivity: If ^4^ e for n e N, then 

oo oo 
n=l n=l 

• Countable Additivity: If A„ e At for n e N are pairwise disjoint, then 

oo oo 

li{[jA^)^^n{A^). 

n=l n=l 



2.6. LEBESGUE MEASURABLE SETS - SUMMARY 45 

• Increasing sequences: If ^„ e for n e N satisfy An C then 

oo 

pL{\\An)^ lim pL{An). 
n=l 

• Decreasing sequences: If A„ e for n e N satisfy A„ D then 

oo 

' ' n^oo 
n=l 
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Exercise 2.6.1. 

1. Prove for a,b E I that fi{[a, b]) = fi{[a, b[) = b — a. 

2. Let X be the subset of irrational numbers in /. Prove /u(X) = 1. Prove that if 
F C / is a closed set and fi(Y) = 1, then Y = I. 

3. {The Cantor middle third set) We first recursively define a nested sequence 
{•Jn}'^=o of closed subsets of /. Each J„ consists of a finite union of closed 
intervals. We define Jq to be J = [0, 1] and let J„ be the union of the closed 
intervals obtained by deleting the open middle third interval from each of the 
intervals in J„_i. Thus Jq = [0, 1], Ji = [0, 1/3] U [2/3, 1] and Js = [0, 1/9] U 
[2/9, 1/3] U [2/3, 7/9] U [8/9, 1] etc. 

Let C = Pi^qJ^. It is called the Cantor Middle Third set. 

(a) When the open middle thirds of the intervals in J„_i are removed we 
are left with two sets of closed intervals: the left thirds of the intervals in Jn-i 
and the right thirds of these intervals. We denote the union of the left thirds 
by Ln and the right thirds by Rn, and we note note J„ = L„ U Prove that 
Ln and Rn each consist of 2"~^ intervals of length 1/3" and hence J„ contains 
2" intervals of length 1/3". 

(b) Let V be the uncountable set set of all infinite sequences did2d^ . . .dn ■ ■ ■ 
where each dn is either or 1 (see part 4. of Exercise (10.3. 13p ) and define a 
function ip : C ^ V hj ip{x) = ^1^2(^3 . . .dn - ■ ■ where each dn = if x E Ln 
and dn = I if X E Rn- Prove that ip is surjective and hence by Corollary (10.3.121) 
the set C is uncountable. Hint: You will need to use Theorem (10.5.31) . 

(c) Prove that C is Lebesgue measurable and that /i(C) = 0. Hint: Con- 
sider C^, the complement of C in /. Show it is measurable and calculate /i(C"^). 
Alternative hint: Show directly that C is a null set by finding for each e > a 
collection of open intervals {Un}'^=i such that 



00 00 
^len(f/„) < e and C C U Un- 

n=l n=l 



Chapter 3 

The Lebesgue Integral 



3.1 Measurable Functions 

In this chapter wc want to define the Lebesgue integral in a fashion which is analogous 
to our definitions of regulated integral and Riemann integral from Chapter 1. The 
difference is that we will no longer use step functions to approximate a function 
we want to integrate, but instead will use a much more general class called simple 
functions. 

Definition 3.1.1 (Characteristic Function). If A <Z [0,1], its characteristic function 
3CAix) is defined by 




1, ifxeA; 
0, otherwise. 



Definition 3.1.2 (Measurable partition). A finite measurable partition of [0, 1] is a 
collection {^1}"=! of measurable subsets which are pairwise disjoint and whose union 
is [0, 1]. 

We can now define simple functions. Like step functions these functions have only 
finitely many values, but unlike step functions the set on which a simple function 
assumes a given value is no longer an interval. Instead a simple function is constant 
on each subset of a finite measurable partition of [0, 1]. 

Definition 3.1.3 (Simple Function). A function f : [0, 1] — > IR is called Lebesgue 

simple or simple, for short, provided there exist a finite measurable partition {Ai}"^^^ 
and real numbers ri such that f{x) = ^1^=1 ^i^Ai- The Lebesgue integral of a simple 
function is defined by J f d/j, = Yl^=i''^il^i^i)- 
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The definition of integral of a simple function should come as no surprise. The 
fact that J Xa{x) dfi is defined to be IJ.{A) is the generalization of the fact that the 
Riemann integral j'' 1 dx = {b — a). The value of / / ri/i for a step function / is, then 
forced if we want our integral to have the linearity property. 

Lemma 3.1.4 (Properties of simple functions). The set of simple functions is a vector 
space and the Lebesgue integral of simple functions satisfies the following properties: 

1. Linearity; If f and g is simple functions and Ci, C2 G then 

cif + C2g di^ = ci / f djji + C2 / g dji. 



2. Monotonicity; /// and g are simple and f{x) < g{x) for all x, then J f d/i < 

3. Absolute value; /// is simple then \f \ is simple and \ J f d/i] < J \f\ d/i. 

Proof. If / is simple, then clearly cif is simple. Hence to show that simple functions 
form a vector space it suffices to show that the sum of two simple functions are simple. 
Suppose {Aj}"^]^ and {Bj}^^^ are measurable partitions of [0, 1] and that f{x) = 
and g{x) = Y^JLi Sj^Bj are simple functions. We consider the measurable 
partition {Cjj} with = A^n Bj. Then Ai = Uj=i ^ij and Bj = UILi 

n n m 

1=1 i=l j=l i,j 

Likewise 

m m n 

j=l j=l i=l ij 

Hence f{x) + g{x) = + ^j)^Cij{x) is simple and the set of simple functions 

forms a vector space. 

It follows immediately from the definition that if / is simple and a e IR, then 
J af d/j, = a J f d/j,. So to prove linearity we need only show that if / and g are 
simple functions as above then J{f + g) d/i — J f d/i + J g d/i. But this follows 
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because 

J2 ^'MCij) + 



n m m 



1=1 j=l j=l i=l 

n in 

i=i j=i 
^ J f dii + J g d/i. 

Monotonicity follows from the fact that if / and g are simple functions with 
f{x) < g{x), then g{x) — f{x) is a non- negative simple function. Clearly from the 
definition of the integral of a simple function, if the function is non-negative, then its 
integral is > 0. Thus J g dfi — J f d/j, ~ J g — f dfi > 0. 

If f{x) = ^rj^Aj, the absolute value property follows from the fact that 



□ 

Exercise 3.1.5. 

1. Prove that if / and g are simple functions, then so is fg. In particular, if 
E C [0, 1] is measurable then JXe is a simple function. 

A function / : [0, 1] — > R U {00} U {—00} will be called an extended real valued 
function. For a G M we will denote the set (—00, a] U {—00} by [—00, a] and the set 
[a, 00) U {00} by [a, 00]. 

Proposition 3.1.6. // / : [0,1] M is an eoctended real valued function, then the 
following are equivalent: 

1. For any a e [—00, 00] the set /~-'^([— 00, a]) is Lebesgue measurable. 

2. For any a e [—00,00] the set /~^([— oo,a)) is Lebesgue measurable. 
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3. For any a G [—oo, oo] the set f ^{[a, oo]) is Lehesgue measurable. 

4- For any a G [—oo, oo] the set /~^((a, oo]) is Lehesgue measurable. 

Proof. We will show 1) ^ 2) ^ 3) ^ 4) ^ 1). 

First assume 1), then [—oo,a[= \J^^i[—oo,a — 2^"]. So 

oo 

r\[-^.a[) = [jr\[-^,a-2-]) 

n=l 

which is measurable by Theorem ( lA.3.8p . Hence 2) holds. 
Now assume 2), then [a, oo] = [—oo, a)'^ so 

f-\[a, oo]) = f-\[-^,a)'^) = if-\[-oo,a)))^. 

Hence 3) holds. 

Assume 3), (a, oo] = |J^]^[a + 2"", oo]. So 

oo 

/-^((a,oo]) = U /"'([« -2-", oo]) 

n=l 

which is measurable by Theorem flA.3.8p . Hence 4) holds. 
Finally, assume 4), then [— oo,a] = (a, oo]^ so 

f-\[-oo,a]) = r\ia,oo]^) = (/" ^((a, oo])r. 

Hence 1) holds. □ 

Definition 3.1.7 (Measurable Function). An extended real valued function f is called 
Lebesgue measurable if it satisfies one (and hence all) of the properties of Proposition 

mm- 

Proposition 3.1.8. // f{x) is a function which has the value except on a set of 
measure 0, then f{x) is measurable. 

Proof. Suppose f{x) = for all x ^ A where A C [0, 1] has measure 0. That is, 
if A = /~i([-oo,0[) U /"^((O, oo]) then A is a null set. For a < the set Ua = 
/~^([— oo,a]) is a subset of A so Ua is a null set and hence measurable. For a > 
the set Ua = /~^([— oo,a]) is the complement of the null set /~^((a, oo]) and hence 
measurable. In either case Ua is measurable so / is a measurable function. □ 
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Theorem 3.1.9. Let be a sequence of measurable functions. Then the ex- 

tended real valued functions 

gi{x) = supfn{x) 

neN 

g2{x) = inf /„(x) 

neN 

5(3 (x) = limsup/„(x) 

n^oo 

g^ix) = liminf /„(x) 

n~>oo 

are all measurable. 

Proof. If a G [—00, 00], then 



{x I giix) > a} = |J{a; I /„(a;) > a}. 



n=l 



Each of the sets on the right is measurable so {x \ gi{x) > a} is also by Theorem 
(lA.3.8p . Hence gi is measurable. 

Since g2{x) = inf„gN fn{x) = — sup„gpj —fn{x) it follows that (72 is also measurable. 

Since the limit of a decreasing sequence is the inf of the terms, 

g-i{x) = limsup/„(a;) = inf sup /„(a;). 

n^oo meN Yi>m 

It follows that g^ is measurable. And since 

gi{x) = liminf /„(x) = - limsup -/„(x) 

^ n — >oo 

it follows that (74 is measurable. □ 

For the following result we need to use honest real valued functions, i.e., not 
extended. The reason for this is that there is no way to define the sum of two 
extended real valued functions if one has the value +00 at a point and the other has 
the value —00 at the same point. 



Theorem 3.1.10. The set of Lebesgue measurable functions from [0, 1] to W is a 
vector space. The set of bounded Lebesgue measurable functions is a vector subspace. 
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Proof. It is immediate from the definition that for c G M the function cf is measurable 
when / is. Suppose / and g are measurable. We need to show that f ~\- g is also 
measurable, i.e., that for any a G M the set Ua = {x \ f{x) +g{x) > a} is measurable. 

Let {rn}'^=i be an enumeration of the rationals. If xq G Ua, i.e., if /{xq) + g{xQ) > 
a, then f{xo) > a — g{xo). Since the rationals are dense there is an rm such that 
f{xo) > rm > a — g{xo). Hence if we define 

Vm = {x \ f{x) > r-m} n {x I g{x) > a- rm} 

then Xq G Vm- So every point of Ua is in some Vm- Conversely if Uq G Vm ior some m, 
then f{yo) > rm > a - g{yo), so f{yo) + g{yo) > a and yo G Ua- Thus Ua = \Jm=i ^rn 
and since each Vm is measurable, we conclude that [/„ is measurable. This shows 
that f -\-g is Sk measurable function and hence the measurable functions form a vector 
space. 

Clearly if / and g are bounded measurable functions and c G M then cf and f + g 
are bounded. We just showed they are also measurable, so the bounded measurable 
functions are a vector subspace. □ 

Exercise 3.1.11. 

1. Prove that if / is a measurable function, then so is 

2. Prove that if / and g are measurable functions, then so is fg. Hint: 2fg = 

if + gf -P- g'- 



3.2 The Lebesgue Integral of Bounded Functions 

In this section we want to define the Lebesgue integral and characterize the bounded 
integrable functions. In the case of the regulated integral, the integrable functions are 
the uniform limits of step functions. In the case of the Riemann integral a function / 
is integrable if the infimum of the integrals of step function bigger than / equals the 
supremum of the integrals of step function less than /. It is natural to alter both these 
definitions, replacing step function with simple function. It turns out that when we 
do this for bounded functions we get the same class of integrable functions whether 
we use the analog of regulated integral or the analog of Riemann integral. Moreover, 
this class is precisely the bounded measurable functions! 

Theorem 3.2.1. // f : [0,1] ^ R is a bounded function, then the following are 
equivalent: 
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1. The function f is Lebesgue measurable. 

2. There is a sequence of simple functions {/n}^i which converges uniformly to 



3. If Ufj,{f ) denotes the set of all simple functions u{x) such that f{x) < u{x) 
for all X and if Cij_{f) denotes the set of all simple functions v{x) such that 
v{x) < f{x) for all X,, then 



Proof. We will show 1) ^ 2) ^ 3) ^ 1). To show 1) =^ 2), assume / is a bounded 
measurable function, say a < f{x) < b for all x G [0, 1]. 

Let e„ = (b — a)/n. We will partition the range [a,b] of / by intervals follows: 
Let Ci = a + ien so a = Co < ci <■■■< On = b. Now define a measurable partition 
of [0,1] by Ai = /-^([ci_i,Ci)) for i < n and A^ = /~^([c„_i, 6]). Then clearly 
fn{x) = c-iXa, is a simple function. Moreover we note that for any x e [0, 1] we 
have \ f{x) — fn{x)\ < Cn- This is because x must lie in one of the A's, say x G Aj. So 
fn{.x) = Cj and f{x) G [cj^i,Cj[. Hence \ f{x) — /n(.x)| < Cj — Cj_i = e„. This implies 
that the sequence of simple functions {fn}'^=i converges uniformly to /. 

To show 2) =^ 3), assume / is the uniform limit of the sequence of simple functions 
{/n}^i- This means if Sn = sup^jg^o,!] \f{x)~fn{x)\, then lim(5„ = 0. We define simple 

functions Vn{x) = - Sn and Un{x) = fn{x) + Sn so Vn{x) < f{x) < Un{x). 
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Then 



inf < I u dn \ < lim inf / u„ dii 

^(^UM) ^ J J "^"^ J 

= lim inf / (/„ + 5„) dn 

n— >oo / 

= lim inf / /„ dfj, 

n— >oo / 

< lim sup / /„ dfx 

n— +00 J 

= lim sup / (/„ - 5n) dfi 

n—>oo J 

< lim sup / Vn dfj, 



< sup { I V dfi \. (3.2.1) 



For any v G C^{f) and any u G U^{f) we have f v d^ < f u dfi so 
sup I V da> < inf < u da}. 

Combining this with the inequality above we conclude that 

sup < V da} = inf < u da>. (3.2.2) 

All that remains is to show that 3) =^ 1). For this we note that if 3) holds, then 
for any n > there are simple functions w„ and Un such that Vn{x) < f{x) < u„(x) 
for all X and such that 

Undfi- I Vn dfi < 2"". (3.2.3) 



By Theorem (13.1.91) the functions 

gi{x) = sup \ Vn{x) \ and g2{x) = inf \ Un{x) \ 

L J neN I. J 



are measurable. They are also bounded and satisfy gi{x) < f{x) < g2{x)- We want to 
show that gi{x) = g2{x) except on a set of measure zero, which we do by contradiction. 
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Let B = {x \ gi{x) < g2{x)} and suppose yu(-B) > 0. Then since B = Ui^i where 
Bm = {x I gi{x) < g2{x) — ^} we conclude that fi{Bmo) > for some niQ. This implies 
that for every n and every x G Bmo we have < gi{x) < g2{x) — < Un{x) — 

So Un{x) - Vn{x) > ^ for all X e Bmo and hence - Vn{x) > ^Xb„^(x) for all 

X. But this would mean that J Un dfi — J Vn dfi = J Un — Vn dfi > J ^^s^g dfj, = 
■;^fj,{Bmo) for all n which contradicts equation (13.2.31) above. 

Hence it must be the case that fi{B) = so gi{x) = g2{x) except on a set of 
measure zero. But since gi{x) < f{x) < g2{x) this means if we define h{x) = 
f{x) —gi{x), then h{x) is zero except on a subset of B which is a set of measure 0. It, 
then follows from Proposition fl3.1.8p that h is a measurable function. Consequently, 
f{x) = gi{x) + h{x) is also measurable and we have completed the proof that 3) =^ 
1). □ 

Definition 3.2.2 (Lebesgue Integral of a bounded function). // / : [0,1] ^ M. is a 
bounded measurable function, then we define its Lebesgue integral by 

or equivalently (by Theorem h3.2.1\) ). 

/f dfi = sup < V dfi>. 

Alternatively, as the following proposition shows, we could have defined it to be 
the limit of the integrals of a sequence of simple functions converging uniformly to /. 

Proposition 3.2.3. // {g-n}'^ is any sequence of simple functions converging uni- 
formly to a bounded measurable function f , then lim^^oo / gn d/i exists and is equal 
to f f dfi. 

Proof. If we let 5„ = sup^j^jQ ^ |/(a;) — gn{x)\, then lim5„ = and 

gn{x) -6n< f{x) < gn{x) + 5n. 
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So Qn-Sn^ ^iiif) and Qn + Sn^ ^M)' Heiice 



f dii = inf u da 



< liminf / {g^ + 5„) dn 

n—*oo J 

= liminf / (?„ d/x 



< limsup / Qn dfi 

n— >oo J 

< limsup / (gn - 5n) d/j 

n— >oo J 



< sup 



{J V dfx^ 



f d/j,. 

Hence these inequalities must be equalities and lim„_,oo J Qn dfi = J f dfi. □ 
Exercise 3.2.4. 

1. Since simple functions are themselves bounded measurable functions, we have 
actually given two definitions of their Lebesgue integral: the one in Definition 
fl3.1.3l) and the one above in Definition (I3.2.2p . Prove that these definitions give 
the same value. 

Theorem 3.2.5. The Lebesgue integral, defined on the vector space of bounded Lebesgue 
measurable functions on [0,1], satisfies the following properties: 

I. Linearity: /// and g are Lebesgue measurable functions and Ci,C2 G M, then 

cif + C2g dfi = ci / f dfi + 02 / g d^. 



II. Monotonicity: // / and g are Lebesgue measurable and f{x) < g{x) for all x, 

then j f d^ < j g dfi. 

III. Absolute value: // / is Lebesgue measurable then \ f\ is also and \ J f dfi\ < 

/I/I dfi. 
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IV. Null Sets: /// and g are hounded functions and f{x) = g{x) except on a set of 
measure zero, then f is measurable if and only if g is measurable. If they are 
measurable, then J f dfi = J g dfi. 

Proof. If / and g are measurable there exist sequences of simple functions {/n}^i 
and {gn}'^=i converging uniformly to / and g respectively. This implies that the 
sequence {ci/„ + C2gn}'^=i converges uniformly to the bounded measurable function 
Cif + C2g. The fact that 



cif + C2g djj 



lim / (ci/„ + C25'„) dfx 
Ci lim fn dfi + 02 lim 

n— >oo / n— >oo 

ci f dfi + 02 g dii 



gn dfX 



implies the linearity property. 

Similarly the absolute value property follows from Lemma (13.1.4^ because 



/ df^ 



lim 

n^oo 



fn dfl 



< lim 

n—*oo 



\fn\ dfi= I/I dfi 



To show monotonicity we use the definition of the Lebesgue integral. If f{x) < 
g{x), then 

J f dfi = sup J ^ '^^} — ■) y u dfi = ^ J g (iyu|. 

If / and g are bounded functions which are equal except on a set E with /i(-E) = 0, 
then h{x) = f{x) — g{x) is non-zero only on the set E. The function h is measurable 
by Proposition (13.1.81) . Clearly, since f = g + h the function / is measurable if and 
only if g is. 

In case they are both measurable \ J f dfi — J g dfi\ = \ J h d/i] < J \h\ dfi. But the 
function h is bounded; say |/i(a;)| < M. Then |/i(a;)| < MXe{x) so by monotonicity 
/ \h\ dn < J MXe dn = Mfi{E) = 0. It follows that \ J f d^i - J g dn\ = so 
f f dfi = f g djj,. □ 

Definition 3.2.6. If E G [0, 1] is a measurable set and f is a bounded measurable 
function we define the Lebesgue integral of f over E by 



f d^= I fXE dii. 
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Proposition 3.2.7 (Additivity) . If E and F are disjoint measurable subsets o/[0, 1], 
then 

f dii= i f dii+ i f dfi. 



JEUF JE JF 

Proof. If E and F are disjoint measurable subsets of [0, 1], then Xeuf = 3£b + jCfS0 
/ f dji^ I fXEuF dji^ I f{XE + Xf) dji^ I f dji^ I f dji. 

JeuF J J JE JF 

□ 

Proposition 3.2.8 (Riemann integrable functions are Lebesgue integrable). Every 
hounded Riemann integrable function / : [0, 1] — M is measurable and hence Lebesgue 
integrable. The values of the Riemann and Lebesgue integrals coincide. 

Proof. The set U{f) of step functions greater than / is a subset of the set of 
simple functions greater than /. Likewise the set C{f) C C^{f). Hence 

sup < / v(t) dt>< sup \ V da> < inf < u da} < ini < u(t) dt >. 
v&df) l- Jo J v&cM) J ^ "^"''(^) J ^ ^0 J 

The fact that / is Riemann integrable asserts the first and last of these values are 
equal. Hence they are all equal and / is measurable and the Riemann and Lebesgue 
integrals coincide. □ 



3.3 The Bounded Convergence Theorem 

We want to investigate when the fact that a sequence of functions {/n}^i converges 
pointwisc to a function / implies that their Lebesgue integrals converge to the integral 
of /. It is straightforward to prove that if a sequence of bounded measurable functions 
converges uniformly to /, then their integrals converge to the integral of /. We will 
not do this, because we prove a stronger result below. But first we consider an 
example which shows what can go wrong. 

Example 3.3.1. Let 

n, a X e \-, -1; 
0, otherwise. 



fnix) 



3.3. THE BOUNDED CONVERGENCE THEOREM 



59 



Then /„ is a step function equal to n on an interval of length - and elsewhere. Thus 
J fn dfi = n^^ = 1. But, for any x G [0, 1] we have fn{,x) = for all sufficiently large 



n. 



Thus the sequence {/n}^i converges pointwise to the constant function 0. Hence 



lim fn{x)) dfi = and 



lim 

n— >oo 



fn dfl = 1. 



In this example each /„ is a bounded step function, but there is no single bound 
which works for all /„ since the maximum value of /„ is n. It turns out that any 
example of this sort must be a sequence of functions which is not uniformly bounded. 

Theorem 3.3.2 (The Bounded Convergence Theorem). Suppose {fn}'^=i is a se- 
quence of measurable functions which converges pointwise to a function f and there 
is a constant M > such that \fn{,x)\ < M for all n and all x G [0, 1]. Then f is a 
bounded measurable function and 



lim 

n— »oo 



fn dn= f dfi. 



Proof. For each x G [0,1] we know that lim^^oo /m(a^) = f{x) 
\f{x)\ < M and by Theorem fl3.1.9p that f{x) is measurable. 
We must show that 



This implies that 



lim 

n— >oo 



fn djj- / f dfx 



0. 



but 



lim 

n— >oo 



fn dfi- / f dfi 



lim 

n—*oo 



[ {fn -f)dfi < hm / \fn - f\ dfi. (3.3.1) 



So we need to estimate the integral of |/„ — / |. 

Given e > define En = {x \ \fm{,x) — f{x)\ < e/2 for all m > n}. Notice that if 
for some n the set En were all of [0, 1] we would be able to estimate J \ fm — f\ dfi < 
J e/2 dfi = e/2 for all m > n. But we don't know that. Instead we know that for any 
X the limit limm^oo fm{x) = f{x) which means that each x is in some En (where n 
depends on x). In other words IJ^i = [0, 1]. 

Since En C En+i by Proposition (12.4.61) we know lim„^oo Ai(-E'n) = /^([0, 1]) = 1. 
Thus there is an hq such that fi{Eno) > 1 — ttt, so fi{E^ ) < 
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Now for any n > uq we have 

/|/„-/|rf/i=/ \fn-f\dfX+ [ \fn-f\dfX 



< ^ dfi+ 2M dfi 

< - + 2M— = e. 

- 2 AM 



Thus we have shown hm^^oo / l/n " /I dfj, = 0. Putting this together with equation 
( I3.3.1I) we see that 



hm 

n— >oo 



fn dfi- J f dfl =0 

as desired. □ 



Definition 3.3.3 (Almost everywhere). // a property holds for all x except for a set 
of measure zero, we say that it holds almost everywhere or for almost all values of x. 

For example, we say that two functions / and g defined on [0, 1] are equal almost 
everywhere if the set of x with f{x) ^ g{x) has measure zero. The last part of Theorem 
(13.2.51) asserted that if f{x) = g{x) almost everywhere, then f f dfi = f g dfi. As 
another example, we say lim^j^oo fn{x) = f{x) for almost all x if the set of x where 
the limit does not exist or is not equal to f{x) is a set of measure zero. 

Theorem 3.3.4 (Better Bounded Convergence Theorem). Suppose {fn}'^=i is a se- 
quence of bounded measurable functions and f is a bounded function such that 

lim fnix) = f{x) 

for almost all x. Suppose also there is a constant M > such that for each n > 0, 
\fn{x)\ < M for almost all x G [0, 1]. Then f is a measurable function, satisfying 
\f{x)\ < M for almost all x G [0, 1] and 

lim fn diJ,= If dii. 



n^oo 
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Proof. Let A = {x \ lim„^oo /n(a;) 7^ f{^)}y then then fi{A) = 0. Define the set 
Dn = {x\ \ fn{x)\ > M}, then then fx{D„) = so if E = v4uU^^^ then = 0. 
Let 



Then 1(7^(3^)1 < for all x G [0,1] and for any x ^ E we have hm„^oo fi'n(a^) = 
hm„^oo /n(a^) = Also for x E E, gn{x) = so so for all x G [0,1] we have 

\imn^^ gn{x) = f{x)XEc{x). 

Define the function g by g{x) = f{x)XE'^{x). For any x ^ E we have g{x) = 
lim„^oo 5'n(a;) = lim„^oo /n(a^) = f{x) so g{x) = f{x) almost everywhere. We know 
from its definition that \g{x)\ < M since |5'ri(a;)| < M. And by Theorem (13.1.91) 
g is measurable. Since f{x) — g{x) is zero almost everywhere it is measurable by 
Proposition fl3.1.8p . It follows that / is measurable and by Theorem fl3.2.5l) 





Since /„ = almost everywhere we also know that 




Hence it will suffice to show that 



n- 




But this is true by Theorem (13.3.21) 



□ 
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Chapter 4 

The Integral of Unbounded 
Functions 

In this section we wish to define and investigate the Lebesgue integral of functions 
which are not necessarily bounded and even extended real valued functions. In fact, 
henceforth we will use the term "measurable function" to refer to extended real valued 
measurable functions. If a function is unbounded both above and below it is more 
complicated than if it is only unbounded above. Hence we first focus our attention 
on this case. 

4.1 Non-negative Functions 

Definition 4.1.1 (Integrable Function). /// : [0, 1] — > M Z5 a non-negative Lebesgue 
measurable function we let fn{x) — min{/(x),n}. Then fn is a bounded measurable 
function and we define 




If J f d/j, < oo we say f is integrable. 

Notice that the sequence {/ /„ dfi}'^^^ is a monotonic increasing sequence of num- 
bers so the limit lim„_»oo / fn d/i either exists or is +00. 

Proposition 4.1.2. If f is a non-negative integrable function and A — {x | f{x) — 
+00}; then IJ>{A) — 0. 
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Proof. For x & A wc observe that fn{x) = n and hence fn{x) > nXA{x) for all x. 
Thus f fn dfi > f hXa d/j, = niJ,{A). If IJ,{A) > 0, then J f d/j, = lim J fn d^ > 
limnfi{A) = +oo. □ 

Example 4.1.3. Let f{x) = 1/^ for x G (0,1] and let /(O) = +oo. Then / : 
[0, 1] — >■ M is a non-negative measurable function. Then the function 



Hence if 




= + (2 - -) 



— + 2-- = 2--. 

n-^ n n 



f dji^ hm / fndjji^ 2. 



Hence 



So / is integrable. 

Proposition 4.1.4. Suppose f and g are non-negative measurable functions with 
g{x) < f{x) for almost all x. If f is integrable, then g is integrable and J g d/i < 
J f d/i. In particular if g — almost everywhere, then J g d/i — 0. 

Proof. If fnix) — mm{ f{x),n} and fi'n(x) = mm{g{x),n}, then /„ and gi„ are bounded 
measurable functions and satisfy gn{x) < fn{x) for almost all x. It follows that 
J gn dfi < J fn dfi < j f dfi. Since the sequence of numbers {J gn dfi}'^^i is monotonic 
increasing and bounded above by J f dfi it has a finite limit. By definition this limit 
is J g d/i. Since for each n we have J gn dfj. < J f d/i, the limit is also bounded by 
J f dji. That is, 

g dn,= lim gn dfx < If dfx. 

11 g — Q almost everywhere, then Q < g{x) < for almost all x so we have J g dji = 
0. □ 
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Corollary 4.1.5. /// : [0, 1] — s> M is a non-negative integrable function and J f d/i — 
then f{x) =0 for almost all x. 

Proof Let En^{x\ f{x) > 1/n}. Then f{x) > ^Xe„{x) so 

-H{E„) = / -Xe^ dn< / / d/x = 0. 
n J n J 

Hence /^(K) = 0. But ii E = {x \ f{x) > 0}, then E = U^=i K so i^{E) = 0. □ 

Theorem 4.1.6 (Absolute Continuity). Suppose f is a non-negative integrable func- 
tion. Then for any e > there exists a S > such that J^f d/i < e for every 
measurable A C [0, 1] with IJ>{A) < 6. 

Proof. Let = mm{f{x),n} so lim J fn d/i — J f d/i. Let 

Er, = {xe [0, 1] I fix) > n} 



so 



fn{x) 



n, if a; G E^, 
fix), ifxeE^. 



Consequently 



Hence we have 



fndii= n dfi+ f d/i. 
J Je„ Je^ 



f d/i^ f diJ,+ f d/i 
J Je„ Jec 

— if — n) djjt, -\- I n dijL-\- f dji 

J En J En J E^ 

= / if -n) dii+ fn d/i. 

J En ^ 

Thus J f d/i — J fn d/i — if — n) d/i and we conclude from integrability of / that 

lim / if — n) djj, — 0. 

J En 
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Hence we may choose N such that J^^if — N) dfi < e/2. Now pick 6 < e/2N. 
Then if fi{A) < 6 we have 

f dfi= f dfi+ f dfi 



< [ {f -N) dfi+ [ N dfi+ [ N dfi 

J AhEm J AhEm JAnE'ff 

[ if-N) dfi+ [ Ndfi 

J Eat J a 



< 

' EjM 

< I + Nfi{A) <^ + N6 <e 



□ 



Theorem (I4.1.6P is labeled "Absolute Continuity" for reasons that will become 
clear later in Section §4.3[ But as a nearly immediate consequence we have the 
following generalization of a result from Exercise (11 .4.31) . 

Corollary 4.1.7 (Continuity of the Integral). // / : [0, 1] — M zs a non-negative 
integrable function and we define F{x) = J^^ / dfi, then F{x) is continuous. 

Proof. Given e > let 5 > be the corresponding value guaranteed by Theorem 
(I4.1.6p . Now suppose x < y and \y — x\ < 6. Then fi{[x,y]) < 6 so 



\F{y) - F{, 



f dfi — I f dfi 

[0,y] J[0,x] 



f dfi 

'^,y] 



< e 



by Theorem (14. 1.61) . We have in fact proven that F is uniformly continuous. □ 
Exercise 4.1.8. 

1. Define f{x) = ^ for x G (0, 1] and /(O) = +oo. Prove that / is integrable if 
and only if p < 1. Calculate the value of j f dfi in this case. 

2. Give an example of a non- negative extended function g : [0, 1] —>■ W which is 
integrable and which has the value +oo at infinitely many points of [0, 1]. 
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4.2 Convergence Theorems 

The following result is very similar to the Bounded Convergence Theorem (see Theo- 
rem (I3.3.2P and Theorem (I3.3.4p . The difference is that instead of having a constant 
bound on the functions fn we have them bounded by an integrable function g. This 
is enough to make essentially the same proof work, however, because of Theorem 

( Km . 

Theorem 4.2.1 (Lebesgue Convergence for Non-negative functions). Suppose fn is 
a sequence of non-negative measurable functions and g is a non-negative integrable 
function such that fn{x) < g{x) for all n and almost all x. If lim fn{x) = f{x) for 
almost all x, then f is integrable and 



f d/j, 



lim 

ra— >oo 



fn d/J,. 



Proof. If we let hn = fn^E and h = fXs where E = {x \ lim/„(x) = f{x)}, then 
f = h almost everywhere and /„ = hn almost everywhere. So it suffices to prove 



h dfi 



lim 

n— »oo 



hn dn, 



and we now have the stronger property that limhn{x) = h{x) for all x, instead 
of almost all. Since /i„(x) = fn{x)XE{x) < g{x) for almost all x we know that 
h{x) < g{x) for almost all x and hence by Proposition f l4.1.4p that h is integrable. 

The remainder of the proof is very similar to the proof of Theorem (I3.3.2p . We 
must show that 



lim 

n— >c» 



hn dfi — / h dfi 



0. 



but 



lim 

ra— >oo 



hn dfi — I h dfi 



lim 

ra— >oo 



{hn -h) dfi < lim / \hn - h\ dfi. (4.2.1) 



So we need to estimate the integral of — 

Given e > define En = {x \ \hm{x) — h{x)\ < e/2 for all m > n}. We know by 
Theorem (14.1.61) that there is a 5 > such that J^g dfi < e/A whenever fi{A) < 6. 

We also know that for any x the limit limm^oo hm{x) = h{x) which means that 
each X is in some En (where n depends on x). In other words IJ^i = [0, 1]. Since 
En C En+i by Proposition fl2.4.6p we know lim„_+oo /^(-^ra) = yu([0, 1]) = 1. Thus there 
is an uq such that fi{Eno) > 1 — 5, so fi{E^^) < 6. 
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Now \hn{x) — h{x)\ < \hn{x) \ + \h{x)\ < 2g{x) so for any n > no we have 



\hn — h\ dfi = / \hn — h\ dfi + / \hn — h\ 
< I d^ + 2g d^ 



< ^fi{E^,) + 2 [ gdfi 



4 + 2i = ^- 



Thus we have shown hm / \hn — h\ dfi = 0. Putting this together with equation 

n— >c * 

fHXTD we see that 



hm 

n— >oo 



hn dfi — / h dfi 







as desired. 



□ 



Theorem 4.2.2 (Monotone Convergence Theorem). Suppose Qn is an increasing 
sequence of non-negative measurable functions. Iflimgn{x) = f{x) for almost all x, 
then 

f di2= hm / gn d^. 



In particular f is integrable if and only z/hm j g^ d^ < +oo. 



Proof. The function / is measurable by Theorem (13.1.91) . If it is integrable, then the 
fact that f{x) > gn{x) for almost all x allows us to apply the previous theorem to 
conclude the desired result. 

Hence we need only show that if J f d^ = +oo, then lim / g^ djj, = +oo. But if 

j f dfi = +00, then for any > there exists no such that J min{/(x), no} dfi > N. 
And we know that 

lim mm{gn{x),no} = min{/(x), no} 
for almost all x. Since these are bounded measurable functions, 

lim / gn dfi > lim / min{(7„,no} dfi = min{/, n-o} dfi > N, 

n— »oo / n— »oo / / 
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where the equahty comes from the bounded convergence Theorem (13.3.41) . Since 
is arbitrary we conclude that 



hm / Qn dfi = +00. 

□ 

Corollary 4.2.3 (Integral of infinite series). Suppose Un is a non-negative measurable 
function and f is an non-negative function such that Yl'^=i'^n{x) = f{x) for almost 
all X. Then 



f dfi = ^ / Un dn- 

n=l ^ 



Proof. Define 

N 



/iv(x) = ^M„(x). 



n=l 



Now the result follows from the previous theorem. □ 

4.3 Other Measures 

There are other measures besides Lebesgue and indeed measures on other spaces 
besides [0, 1] or R. We will limit our attention to measures defined on J = [0, 1]. 

Recall that a collection A of subsets of / is called a a-algebra provided it contains 
the set / and is closed under taking complements, countable unions, and countable 
intersections. 

Examples 4.3.1. The following are examples of a-algebras on / = [0, 1]: 

1. The trivial a -algebra. A= {0,/}. 

2. A= {A G I \ A is countable, or A'^ is countable}. 

3. A = ^A the Lebesgue measurable sets 

4. A is Borel sets, the smallest a-algebra containing the open intervals. 

Definition 4.3.2 (Finite Measure). If A is a a-algebra of subsets of I, then a function 
u : A ^ M. is called a finite measure provided 
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• ^(^) ^ for every A & A, 

• 1/(0) = 0, < oo, and 

• v is countably additive, i.e. i/{^n}$^i o^re pairwise disjoint sets in A, then 

oo oo 

n=l n=l 



We will restrict our attention to measures defined on the o"-algebra of Lebesgue 
measurable sets. The integral of a measurable function with respect to a measure v 
is defined analogously to Lebesgue measure. 

Definition 4.3.3. Let u be a finite measure defined on the a -algebra M.{I). If 
/(^) — Yll=i ^i^Ai is a simple function then its integral with respect to u is de- 
fined by J f du = ^"=1 If g : [0,1] M is a bounded measurable function, 
then we define its integral with respect to v by 

/q dv — inf { l u dv\. 
ueUM I- J J 

If h is a non-negative extended measurable function we define 




mm{h, n} du. 



Definition 4.3.4 (Absolutely Continuous Measure). If u is a measure defined on 
M.{I), the Lebesgue measurable subsets of I,, then we say v is absolutely continuous 
with respect to Lebesgue measure /i if IJ>{A) — implies v{A) — 0. 

The following result motivates the name "absolute continuity." 

Theorem 4.3.5. If v is a measure defined on A4{I) which is absolutely continuous 
with respect to Lebesgue measure, then for any e > there is a S > such that 
viA) < e whenever n{A) — 5. 

Proof. We assume there is a counter-example and show this leads to a contradiction. 
If the measure u does not satisfy the conclusion of the theorem, then there is an e > 
for which it fails, i.e. there is no 5 > which works for this e. In particular, for any 
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positive integer m there is a set Bm such that v{Bm) > e and fi{Bm) < 1/2™. Hence 
if we define A„ = Um=n+i ^rn, then 

oo "'^ 1 1 

f.{An) < /^(^-) < E ^ = ^ 

m=n+l m=ra+l 

The sets y4„ are nested, i.e. y4„ D It follows from Proposition (12.4.61) that 

oo ^ 

/i(n A„) = lim /i(A„) < hm — = 0. (4.3.1) 

' ' n— »oo n— »oo Z 
n=l 

The proof of Proposition (I2.4.6P made use only of the countable additivity of the 
measure. Hence it is also valid for z/, i.e. 

oo 

1/(0 A„) = lim p{An). 

' < n— >oo 

n=l 

On the other hand z/(y4„) > z/(i?„+i) > e, so 

oo 

z/((| An) = lim iy{An) > lim e = e. 

' ' n^oo n^oo 

n=l 

This together with equation (14.3.11) contradicts the absolute continuity of u with 
respect to /i. We have proven the contrapositive of the result we desire. □ 

Exercise 4.3.6. Given a point xq G [0, 1] define the function ^^^^ : — > R by 
4o(A) = 1 if xo e A and 5^.o(A) = ii xq ^ A. Let z/(A) = 6^o{A). 

1. Prove that z/ is a measure. 

2. Prove that if / is a measurable function j f du = /(xq). 

3. Prove that u is not absolutely continuous with respect to Lebesgue measure fi. 
The measure u is called the Dirac 6-measure. 

Proposition 4.3.7. If f is a non-negative integrable function on I and we define 

iyf{A)= f dfi 

J A 

then Vf is a measure with a-algehra M.{I) which is absolutely continuous with respect 
to Lebesgue measure ^. 
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Proof. Clearly iyf{A) = j\^f dfi > for all A E Ai since / is non-negative. Also 
z/j(0) = 0. We need to check countable additivity. 

Suppose {An}'^^i is a sequence of pairwise disjoint measurable subsets of [0,1]. 
and A is their union. Then for all x G [0.1]. 



f{x)XA{x) = ^/(x)XA„(a;). 

n=l 

Hence by Theorem (14.2.31) 

/oo p 
/Xa dfx = Y fXA„ d^i 
n=l ^ 

and so 

oo 
n=l 

Thus 1/ is a measure. 

If = 0, then by Proposition (14.1.41) 



iyf{A) = j fXAdfi = 



so z/ is absolutely continuous with respect to /i. □ 

The converse to Proposition fl4.3.7p is called the Radon-Nikodym Theorem. Its 
proof is beyond the scope of this text. A proof can be found in Chapter 11 Section 5 
of Royden's book [1] 

Theorem 4.3.8 (Radon-Nikodym). If u is a measure with cr-algebra Ai{I) which is 
absolutely continuous with respect to Lebesgue measure /i, then there is a non-negative 
integrable function f on [0, 1] such that define 

iy{A) = f dfi. 

J A 

The function f is unique up to measure 0, i.e. if g is another function with these 
properties, then f = g almost everywhere. 

The function / is called the Radon-Nikodym derivative of u with respect to fi. 
In fact the Radon-Nikodym Theorem is more general than we have stated, since it 
applies to any two finite measures u and /i defined on a a-algebra A with u absolutely 
continuous with respect to /x. 
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4.4 General Measurable Functions 

In this section we consider extended measurable functions which may be unbounded 
both above and below, we will define 

f^{x) =max{/(x),0} and f~{x) = — min{/(a;), 0}. 

These are both non-negative measurable functions. 

Definition 4.4.1. // / : [0,1] ^ W is a measurable function, then we say f is 
Lebesgue integrable provided both and are integrable (as non-negative func- 
tions). If f is integrable we define 

f dfi = J f^ dn- j f- dfj.. 

Proposition 4.4.2. Suppose f and g are measurable functions on [0, 1] and f = g 

almost everywhere. Then if f is integrable, so is g and J f dfi = J g dfi. In particular 
"^f f = ^ almost everywhere j f dfi = 0. 

Proof. If / and g are measurable functions on [0, 1] and f = g almost everywhere, 
then = g^ almost everywhere, /~ = g~ almost everywhere, and f~^ and /~ are 
integrable. It, then follows from Proposition ( 14.1.4^ that g^ and g~ are integrable 
and that j f^ d^ > j g^ dfi and J f~ d^ > j g^ dfi. Switching the roles of / and 
g this same proposition gives the reverse inequalities so we have J f~^ d^ = f g'^ dfi 
and f f~ dfi = f g^ dfi. □ 

Proposition 4.4.3. The measurable function / : [0, 1] ^ M is integrable if and only 
if the the function \ f\ is integrable. 

Proof. Notice that = /^(x) + f~{x). Thus if |/| is integrable, since > 

f'^ix) and > f^{x) it follows from Proposition fl4.1.4l) that both /+ and 

are integrable. Conversely if f~^ and /~ are integrable then so is their sum |/|. □ 

Theorem 4.4.4 (Lebesgue Convergence Theorem). Suppose fn is a sequence of mea- 
surable functions and g is an non-negative integrable function such that \fn{x)\ < g{x) 
for all n and almost all x. If lim fn{x) = f{x) for almost all x, then f is integrable 
and 

f dfi= lim / /„ dfi. 



n—*oo 
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Proof. The functions fni^) = max{/„(x), 0} and (x) = — min{/„(a;), 0} satisfy 
lim = and lim /^(x) = /"(x) 

n—>oo n—foo 

for almost all x. Also g{x) > fni^) and g{x) > fni^) for almost all x. Hence by 
Theorem (K2A}\ 



djji = lim / djji and / / djj, = lim / /„ d^. 
Thus / is integrable and 

f dii= I dji- i r d^i 



lim / d^- lim / /„ rf/x 

n— >oo / n— »oo / 

lim / fn - fn df^ 

n— >oo / 

lim / /„ dn- 



□ 

The following theorem says that for any e > any integrable function can be 
approximated within e by a step function if we are allowed to exclude a set of measure 
e. 

Theorem 4.4.5. /// : [0, 1] — M an integrable function, then given e > there is 
a step function g : [0,1] M. and a measurable subset A C [0, 1] such that fi{A) < e 
and 

— g{x)\ < e for all x ^ A. 
Moreover, if < M for all x, then we may choose g with this same bound. 

Proof. We first prove the result for the special case of f{x) = Xe{x) for some mea- 
surable set E. This follows because there is a countable cover of E by open intervals 
{Ui}Zi such that 

OO 

fi{E)<Y,len{U,)<fi{E)+'-. 

i=l 
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and hence 

oo 

^^{{W\E)<'^■ (4.4.1) 

i=l 

Also we may choose > such that 

oo oo 
i=N i=N 

Let Vat = U^^^f/j. It is a finite union of intervals, so the function g{x) = Xy^ is a 
step function and if A = {x \ f{x) ^ g{x)}, then 

oo oo 

ag(v^\e')u (^e\Vn) c (^{\Ju,)\e^ u (U f/,), 

i=l i=N 

so it follows from equations (I4.4.ip and (I4.4.2p that fJ.{A) < e. This proves the result 
for / = Xe. 

From this the result follows for simple functions / = ^rjX^;^ because if gi is the 
approximating step function for X^;. then g = ^ r^gi approximates / (with a suitably 
adjusted e). 

If / is a bounded measurable function by Theorem (13.2.11) there is a simple function 
h such that |/(a;) — h{x)\ < e/2 for all x. Let g he a step function such that \h{x) — 
g{x)\ < e/2 for all x ^ A with fi{A) < e. Then 

\f{x) - g{x)\ < \f{x) - h{x)\ + \h{x) - g{x)\ < £ + | = e, 

for all X ^ A. That is, the result is true if / is a bounded measurable function. 

Suppose / is a non- negative integrable function. Let An = {x \ f{x) > n}. Then 

nn{An) = J nXA„ dfi < J f dfi< oo. 

It follows that limyu(y4„) = 0. Hence there is an > such that fi^A^) < e/2. 

If /at = min{/, A^}, then is a bounded measurable function. So we may choose 
a step function g such that \fN{x) — g{x)\ < e/2 for all a; ^ i? with /i(-B) < e/2. It 
follows that if y4 = Ajy U B, then ^{A) < e. Also if x ^ A, then /(x) = /iv(x) so 



\fix) - gix)\ < |/(x) - Mx)\ + \Mx) - gix)\ = |/^(x) - 
Hence the result holds for non-negative /. 



g{x)\ < e. 
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For a general integrable / we have f = — f~ ■ The fact that the result holds 
for and easily implies it holds for /. 

Suppose now that / is bounded, say |/(a;)| < M for all x and g satisfies the 
conclusion of our theorem, then we define 

{M, if g{x) > M; 
g{x), if - M<g{x)<M; 
-M if g{x) < -M. 

The function gi is a step function with < M and gi{x) — g{x) except when 

\g{x)\ > M. Note if g{x) > M and x ^ A then f{x) < M = gi{x) < g{x) so 
\gi{x) — f{x)\ < 6. The case g{x) < —M is similar. 

□ 

Theorem 4.4.6. The Lebesgue integral satisfies the following properties: 

I. Linearity: If f and g are Lebesgue measurable functions and Ci, C2 G M., then 

J cif + C2g dji^ci J f dii + C2 J g dji. 

II. Monotonicity: // / and g are Lebesgue measurable and f{x) < g{x) for all x, 

then J f d/j, < J g d/i. 

III. Absolute value: /// is Lebesgue measurable then \ f\ is also and \ J f d/i] < 

/I/I df^. 

IV. Null Sets: /// and g are bounded functions and f{x) — g{x) except on a set of 

measure zero, then f is measurable if and only if g is measurable. If they are 
measurable, then J f d/i — J g d/i. 

The proof is left as an exercise. 

Exercise 4.4.7. 

1. Prove that if /, g, h are measurable functions and f = g almost everywhere and 
g = h almost everywhere, then f = h almost everywhere. 

2. Prove that if / : [0, 1] — > R is an integrable function, then given e > there 
exists a continuous function : [0, 1] ^ M and a set A such that i-i{A) < 
e, \f{x) - g{x)\ < e for all x ^ A, and g{0) = g{l). 
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3. Prove that the, not necessarily bounded, integrable functions from [0, 1] to M 
form a vector space. 

4. Prove Theorem fl4.4.6l) . Proposition f l4.4.2p proves the null set property. Prove 
the remaining parts of this theorem, namely linearity, monotonicity, and the 
absolute value property. (You may use Theorem (13.2.51) ). 
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Chapter 5 

The Hilbert Space L'^[-1, 1] 



5.1 Square Integrable Functions 

In this chapter we will develop the beginnings of a theory of function spaces with many 
properties analogous to the basic properties of M". To motivate these developments 
we first take a look at M" in a different way. We let X be a finite set with n elements, 
say, X = {l,2,3,...,n} and we define a measure u on X which is called the "counting 
measure" . 

More precisely, we take as o"-algebra the family of all subsets of X and for any 
A G X we define i'{A) to be the number of elements in the set A. It is easy to 
see that this is a measure and that any function / : X — » R is measurable. In fact 
any function is a simple function. This is because there is a partition of X given by 
Ai — {i} and clearly / is constant on each Ai, so f — Y17=i '"'i^Ai where n = f{i). 

Consequently we have 



For reasons that will be clear below we will denote the collection of all functions 
from X to M by L^(X). The important thing to note is that this is just another name 
for W^. More formally, there is a vector space isomorphism of L'^{X) and M" given 
by / < — > (xi, a;2, . . . , Xn) where Xi = f{i). 

Under this isomorphism it is important to note what the inner product (or "dot" 
product < x,y >— Y17=i -"^iVi becomes, li f,g & LP'{X) are the functions correspond- 



n 



n 




1=1 1=1 
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ing to vectors x and y respectively, then Xi = f{i) and yi = g{i) so 



n 



n 



<x,y>= ^XiVi = ^f{{}g{{} 



fg dv. 



i=l i=l 



Also the norm (or length) of a vector is given by 



n 



n 




x\\'^ =< X, X >= 



i=l i=l 



It is this way of viewing the inner product and norm on M" which generalizes nicely 
to a space of real valued functions on the interval. 

In this chapter it will be convenient (for notational purposes) to consider functions 
on the interval [—1, 1] rather than [0, 1]. Of course, all of our results about measurable 
functions and their integrals remain valid on this different interval. 

Definition 5.1.1. A measurable function f : [—1, 1] — M zs called square integrable 
if f{x)^ is integrable. We denote the set of all square integrable functions by L^[—l, 1]. 
We define the norm of f E 1, 1] by 



Proposition 5.1.2. The norm \\ \\ on L^[— 1,1] satisfies \\af\\ = \a\\\f\\ for all a E M. 
and all / G 1, 1]. Moreover for all f, ||/|| > with equality only if f = almost 
everywhere. 

Proof. We see 




af\\ = {j a'p dixy = 




f dfi 



a\\\f\\. 



Since, J f^ dfi > clearly ||/|| > 0. Also if 




So by Corollary fl4.1.5p = almost everywhere and hence / = almost everywhere. 

□ 
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Lemma 5.1.3. // /, (7 G 1, 1], then fg is integrable and 

2 1 \fg\ df,<\\fr+\\gr. 

Equality holds if and only if \ f\ = \g\ almost everywhere. 
Proof. Since 

< - \gix)\f = f{xf - 2\f{x)g{x)\ + g{xf 

we have 2\f{x)g{x)\ < /(x)^ + g{xY. Hence by Proposition (14.1.41) we conclude that 
\fg\ is integrable and that 

2 j \fg\d^^<\\f\\'' + \\gf. 

Equality holds if and only if j {\f {x)\ — \g{x)\)'^ dji = Q and we may conclude by Corol- 
lary fl4.1.5p that this happens if and only if — [(^(x)!)^ = almost everywhere 
and hence that |/| = \g\ almost everywhere. □ 

Theorem 5.1.4. 1,1] is a vector space. 

Proof. We must show that if /, G 1, 1] and c G M, then cf G L'^[—l, 1] and 
if + 9) ^ -^^["l) !]• The first of these is clear since integrable implies that c^/^ is 
integrable. 

To check the second we observe that 

if + gY = f + 2fg + g'<f + 2\fg\+g'. 

Since g^ and \ fg\ are all integrable, it follows from Proposition (14.1.41) that {f+gY 
is also. Hence {f + g) e L'^[-l, 1]. □ 

Theorem 5.1.5 (Holder Inequality). If f,g G L^[— 1,1], then 

I \f9\ dfi<\\f\\ \\g\\. 

Equality holds if and only if there is a constant c such that |/(a;)| = c|(7(a;)| or \g{x) \ = 
c\f{x)\ almost everywhere. 
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Proof. If either ||/|| or ||5f|| is the resuh is trivial so assume they are both non-zero. 
In that case the functions /o = //||/|| and qq = g/\\g\\ satisfy ||/o|| = \\go\\ = 1. 
Then by Lemma fl5.1.3p 



2 J \fo9o\ ci/x < ll/of + ll^oir = 2, 



so 

j l/ofl-ol dfi < 1, 

and equality holds if and only if |/o| = \go\ almost everywhere. So 

Ifgl dfi= l/o^ol dfi<l 



\9\ 

and hence 

j \fg\ dfi < 

Equality holds if and only if |/o| = \go\ almost everywhere, which implies there is 
a constant c with = c|(7(x)| almost everywhere. □ 

Corollary 5.1.6. If f,g e L'^[-l, I], then 

fgdix\<\\f\\ \\g\\. 

Equality holds if and only if there is a constant c such that f{x) = cg{x) or g{x) = 
cf{x) almost everywhere. 

Proof. The inequality follows from Holder's inequality and the absolute value inequal- 
ity since 

J fgdfi\< j \fg\dfx<\\f\\ \\g\\. 

Equality holds when both of these inequalities are equalities. If this case, suppose 
first that f fg dfj, > 0. Then J \ fg\ dfi = f fg d^, so / \ fg\ — fg dfi = and hence 
\fg\ = fg almost everywhere. This says that / and g have the same sign almost 
everywhere. Since the second inequality is an equality we know from Holder that 
there is a constant c such that = c|5'(x)| or \g{x)\ = c\f{x) \ almost everywhere. 

This togther with the fact that / and g have the same sign almost everywhere implies 
f{x) = cg{x) or g{x) = cf{x) almost everywhere. For the case that that J fg d^ < 
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we can replace / with — / and conclude that f{x) = —cg{x) or g{x) = —cf{x). 
Conversely, it is easy to see that if /(,x) = cg{x) or g{x) = cf{x) almost everywhere, 
then the inequality above is an equality. 



The following result called the Minkowski Inequality, is the triangle inequality for 
the vector space 1, 1]. 

Theorem 5.1.7 (Minkowski's Inequality). ///, y G 1, 1], then 



Taking square roots of both sides of this equality gives the triangle inequality. □ 

Definition 5.1.8 (Inner Product on L^[— 1,1]). If f,g G L^[— 1, 1], then we define 
their inner product by 



Theorem 5.1.9 (The Inner Product on L^[—l, 1]). For any fi, f2,g G 1, 1] and 
any Ci, C2 G M the inner product on L'^[—l, 1] satisfies the following properties: 

1. Commutativity; {f,g)^{gj). 

2. Bi-linearity; (ci/i + C2/2, g) ^ Ci{fi,g) + C2{/2, g). 

3. Positive Definiteness; {g,g) = \\g\\^ > with equality if and only if g = almost 
everywhere. 



□ 



ll/ + ^ll< 11/11 + lb 



Proof. We observe that 




< ll/f + 2||/|| ll^ll + \\gf by H51der's inequality 

-m + \\9\\r- 
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Proof. Clearly {f,g) = J fg dfi = J gf dfi = {g, /). Bi-linearity holds because of the 
linearity of the integral. Also {g,g) = j g^ dfi > 0. Corollary (14.1.51) implies that 
equality holds only if g^ = almost everywhere. □ 

Notice that we have almost proved that 1, 1] is an inner product space. The 
one point where the definition is not quite satisfied is that ||/|| = implies / = 
almost everywhere rather than everywhere. The pedantic way to overcome this 
problem is to define L^[— 1,1] as the vector space of equivalence classes of square 
integrable functions, where / and g are considered "equivalent" if they are equal 
almost everywhere. It is customary, however, to overlook this infelicity and simply 
consider L^[— 1,1] as a vector space of functions rather than equivalence classes of 
functions. In doing this we should keep in mind that we are generally considering two 
functions the same if they agree almost everywhere. 

5.2 Convergence in 1, 1] 

We have discussed uniform convergence and pointwise convergence and now we wish 
to discuss convergence in the L^[— 1,1] norm || ||. The vector space 1,1] is, of 
course, a metric space with distance function given by dist(/, ^f) = ||/ — g\\. Note 
that dist (/,(?) = if and only if f = g almost everywhere, so again if we wish to 
be pedantic this metric space is really the equivalence classes of functions which are 
equal almost everywhere. 

Definition 5.2.1. If {fn}'^=i is a sequence in L^[— 1, 1], then it is said to converge to 
in measure of order 2 or to converge in 1, 1] if there is a function f G 1, 1] 
such that 

Mm = 0. 

n^oo 

Lemma 5.2.2 (Density of Bounded Functions). If we define 

{n, if fix) > n; 
fix), if - n< fix) < n; 
-n if fix) < -n, 

then 

\im = 0. 

n— >oo 
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Proof. We will show that for any e > there is an n such that ||/ — < e. First 
we note that |/n(a;)| < so 

- fn{x)\' < \fix)\' + 2|/(x)| + < 4|/(x)r. 

Let En = {x\ \f{x)\ >n} = {x\ > n^} and let C = / |/p d/x. Then 

C= f I/I' dfi> f Iff dii> I n'' dfi = n^fiiEn) 

J J En 'J En 

and we conclude that fJ.{En) < C /'n?. 

We know from absolute continuity, Theorem (14. 1.61) . that there is a 5 > such 
that l/p (i/i < e/4 whenever < 5. Thus we have 

\\f-fn\\'=f\f-fn\'d^i=[ \f-fn\'d^i<[ A\ f \' d^M < a'- = 6 

J J En ^ En 

whenever n is sufficiently large that fi{En) < C /n"^ < 6. □ 

Proposition 5.2.3 (Density of Step Functions and Continuous Functions). The step 
functions are dense in 1, 1]. That is, for any e > and any f G 1, 1] there is 
a step function g : [—1, 1] ^ M such that \\f — g\\ < e. Likewise, there is a continuous 
function h : [—1, 1] ^ M such that \\f — h\\ < e. The function h may be chosen so 
h{-l) = h{l). 

Proof. By the preceding result we may choose n so that || / — /n|| < f • Note that 
|/n(3;)| < n for all x. Suppose now that 6 is any given small positive number. Ac- 
cording to Theorem fl4.4.5p there is a step function g with \g\ < n and a measurable 
set A with fi{A) < 6 such that |/n(x) — g{x)\ < 6 if x ^ A. Hence 

= / \fn-g\^dfi+ / \fn-9?dll 
J A J A'' 

,2 J.. , / x2 



< An^ dfi+ 6^ dn 

J A J A'' 

< 4nV(^) + S^K^") < 4:n^5 + 25\ 



Clearly if we choose S sufficiently small, then 



\\fn-g\\<VAn^S + W<-. 
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It follows that 11/ - ^11 < 11/ - /nil + ||/„ - ^11 < e. 

The proof for continuous functions is the same, except Exercise (14.4. 7p is used in 
place of Theorem fl4.4.5p . The details are left as an exercise. □ 

Definition 5.2.4. An inner product space (V, ( , )) which is complete, i.e. in which 
Cauchy sequences converge, is called a Hilbert space. 

For example, with the usual dot product is a Hilbert space. 
We want to prove that 1, 1] is a Hilbert space. 

Theorem 5.2.5. 1,1] is a Hilbert space. 

Proof. We have already shown that L^[— 1,1] is an inner product space. All that 
remains is to prove that the norm || || is complete, i.e. that Cauchy sequences converge. 

Let {/n}^i be a Cauchy sequence. Then we may choose numbers Ui such that 
1 1 /m — /nil < 1/2* whenever m,n > Ui. Hence if we define go = and gi = fm for 
i > 0, then \\gi+i — gi\\ < 1/2* so, in particular \\gi+i — giW converges, say to S. 

Consider the function hn{x) defined by 

n-l 



hnix) = ^ \gi+lix) - g^{x)\. 



i=0 



For any fixed x the sequence {hn{x)} is monotone increasing so we may define the 
extended real valued function h by h{x) = lim hn{x). Note that by the Minkowski 

n— »oo 

inequality 



rt-l 



I /in II < 2^ hi+l - QiW < s. 
i=0 



Hence / /i^ d/x = ||/i„|p < S'^. Since hn{x)'^ is a monotonic increasing sequence of 
non-negative measurable functions converging to h"^ we conclude from the Monotone 

Convergence Theorem « that f = li.n / ft| < so is integrable. 

Since h"^ is integrable, h{x) is finite almost everywhere. For each x with finite 
h{x) the series of real numbers Yliloidi+'^i-^) ~ 9ii.^)) converges absolutely and hence 
converges by Theorem fl0.2.6p . We denote its sum by g{x). For x in the set of measure 
where h{x) = +oo we define g{x) = 0. Notice that 



n— 1 



9n{x) = ^{gi+i{x) - gi{x)) 



2=0 
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because it is a telescoping series. Hence 

n-l 

lim = lim y^(gi+i{x) - gi{x)) = g{x) 

n—foo n—>oo ' ' 

i=0 

for almost all x. Moreover 

n-l 

\g{x)\ = lim \gn{x)\ < lim y^ \gi+i{x) - giix)\ = lim hn{x) = h{x) 

«— >no n— »oo ' ' n—>oo 
j=0 



for almost all x so |5'(a;)p < h{xY and hence is integrable and g G 1, 1]. 

We also observe that 

\g{x) - gn{x)\^ < {\g{x)\ + \gn{x)\f < i2h{x)f. 

Since lim \g(x) — gn{x)\'^ = for almost all x the Lebesgue Convergence Theorem 

n~KX) 

(14.4.41) tells us lim / \g{x) — dfi = 0. This implies lim H^f — gn\\ = 0. 

?i— >oo / n^oo 

Hence given e > there is an i such that — SfiH < e/2 and 1/2* < e/2. Recalling 
that gi = fm we see that whenever m > rii we have Hf? — /m|| < + Ibt^/mll < 

e/2 + e/2 = e. Hence lim \\g - = 0. □ 



5.3 Hilbert Space 

In any Hilbert space we can, of course, talk about convergent sequences and series. 
The meaning is precisely what you would expect. In particular, if 7i is a Hilbert 
space and {xn} is a sequence, then 

lim Xn = X 

means that for any e > there is an > such that ||a; — x^H < e whenever n > N. 
This is exactly the usual definition in M except we use the norm || || in place of 
absolute value. Also if is a sequence in TC, then 

oo 

y^^Um = S 

m=l 

means lim s„ = s where 

n 
m=l 

We will say a series Ylm=i converges absolutely provided Ylm=i W'^mW converges. 
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Proposition 5.3.1. // a series in a Hilbert space converges absolutely then it con- 
verges. 

Proof. Given e > there is an > sucli tliat whenever n > m > N, 

n oo 



oo 

||Mm|| < e. 
i=m 



Let Sn = Yl^=i '^i^ then — s^H < J2'i=m ll'^mll < ^- follows that {s„} is a Cauchy 
sequence. Hence it converges. □ 

We will also talk about perpendicularity in H. We say x,y are perpendicular 
(written x ± y) if (x, y) = 0. 

Theorem 5.3.2 (Pythagorean Theorem). If xi, X2, ■ ■ ■ Xn are mutually perpendicular 
elements of a Hilbert space, then 

i=l i=l 

Proof. Consider the case n = 2. If x -L y, then 

||a; + y||2 = (^x + y,x + y) = {x,x) + 2{x,y) + {y,y) = \\xf+ ||yf 
since (x, y) = 0. The general case follows by induction on n. □ 

Definition 5.3.3. If Ti. is a Hilbert space, a bounded linear functional on Ti. is a 

function L : ^ M such that for all v,w & Ti. and Ci,C2 G M, L{ciV + C2w) = 
CiL{u) + C2L{w) and such that there is a constant M satisfying \L{v)\ < M||f || for 
all V E H. 

The following result was proved in Proposition (10. 7. 31) . In the case of the Hilbert 
space L^[—l, 1] it is just the corollary to Holder's inequality, Corollary (I5.1.6p . 

Proposition 5.3.4 (Cauchy-Schwarz Inequality). If {H, ( , )) is a Hilbert space and 
v,w eH, then 

\{v,w)\ < \\v\\ \\w\\, 
with equality if and only if v and w are multiples of a single vector. 
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For any fixed x G 7i we may define L : Ti. —>■ hj L[v) = {v,x). Then L is a 
linear function and as a consequence of tlie Caucliy-Scfiwarz inequality it is bounded. 
Indeed < M||f || where M = \\x\\. Our next goal is to prove that these are the 

only bounded linear functions from Ti. to M. 

Lemma 5.3.5. Suppose Ti is a Hilbert space and L : Ti. —>■ M. is a bounded linear 
functional which is not identically 0. If V = L~^(l). Then there is a unique x 
such that 

\\x\\ = inf \\v\\. 

That is, there is a unique vector in V closest to 0. Moreover, the vector x is perpen- 
dicular to every element of L^^{0), i.e. ifvETi and L{v) = 0, then {x,v) = 0. 

Proof. We first observe that V is closed, i.e. that any convergent sequence in V has 
a limit in V. To see this suppose limx„ = x and x„ G V. Then \L{x) — L{xn)\ = 
\L{x — Xn)\ < M\\x — Xn\\ ioT some M. Hence since L{xn) = 1 for all n, we have 
\L{x) — 1| < limM||a; — = 0. Therefore L{x) = 1 and x G V. 

Now let d = inf ||f || and choose a sequence {xn}'i^=i in V such that lim = d. 

We will show that this sequence is Cauchy and hence converges. 

Notice that (x„ + Xm)/2 is in V so + Xm)/2|| > d or ||x„ + Xm\\ > 2d. By the 
parallelogram law (Proposition (10. 7. 41) 

ll^n ^m|| ~t~ ll^n ~t~ ^m|| 2||2^n|| . 

Hence 

As m and n tend to infinity the right side of this equation goes to 0. Hence the left 
side does also and lim ||a;„ — x^H = 0. That is, the sequence {xn}'^=i is Cauchy. Let 
X G V be limit limit of this sequence. Since < ||a; — + for all n, we have 

||a;|| < lim \\x — Xn\\ + lim = d. 

n— »oo n—too 

But X G V implies > c? so ||x|| = d. 

To see that x is unique suppose that y is another element of V and \\y\\ = d. Then 
(x + i/)/2 is in V so ||x + y\\ > 2d. Hence using the parallelogram law again 

||x - yf = 2\\xf + 2\\yf - ||x + yf < Ad^ - Ad^ = 0. 
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We conclude that x = y. 

Suppose that v G L~^(0). We wish to show it is perpendicular to x. Note that for 
alH G M the vector x + tv E L~^{1) so ||x + |p > Hxp. Hence 

||a;||^ + 2t < x,v > +t^\\v\\^ > ||a;||^, so 

2t < x,v > +t^||f P > for all t eR. This is possible only ii < x,v >= 0. □ 

In the following theorem we characterize all the bounded linear functionals on a 
Hilbert space. Each of them is obtained by taking the inner product with some fixed 
vector. 

Theorem 5.3.6. IfTi. is a Hilbert space and L : Ti ^ M. is a bounded linear functional, 
then there is a unique x ^Ti such that L{v) = {v,x). 

Proof. If L{v) = for all v, then a; = has the property we want, so suppose L 
is not identically 0. Let xq G 7i be the unique point in L"^(l) with smallest norm, 
guaranteed by Lemma f l5.3.5p . 

Suppose first that v E H and L{v) = 1 Then L{v — Xo) = L{y) — L{xq) = 1 — 1=0 
so by Lemma fl5.3.5p {v — Xo,Xo) = 0. It follows that the vector x = xq/WxqW^ is also 
perpendicular to f — xq so 

{v, x) = {v, T-^) = {v ~ Xo, + {xo, TT^) = 1 = L{v). 

IFoII IfoII IfoII 

Hence for any v with L{v) = 1 we have L{v) = (f,x). Also for any v with L{v) = 
we have L{v) = = {v, x) by Lemma fl5.3.5p . 

Finally for an arbitrary w ETi with L[w) = c 7^ we define v = w/c so L{v) = 
L{w)/c = 1. Hence 

L{w) = L{cv) = cL{y) = c{v,x) = {cv,x) = {w,x). 

To see that x is unique, suppose that y eH has the same properties then for every 
V E H we have {v, x) = L{v) = {v, y). Thus {v,x — y) = for all v and in particular 
for t> = X — y. We conclude that ||x — yW^ = {x — y,x — y) = so x = y. 

□ 

5.4 Fourier Series 

Definition 5.4.1. A family of vectors {un} in a Hilbert spaceTi is ca//ec? orthonormal 

provided for each n, = 1 and {un,Um) = if n ^ m. 



5.4. FOURIER SERIES 



91 



Theorem 5.4.2. The family of functions T = {;^, cos(n7ra;), sin(n7ra;)}^;^ is an 
orthonormal family in L^[— 1, 1]. 

For a proof see Chapter 1 of [3j. 

Theorem 5.4.3. // {Mn}^=o finite orthonormal family of vectors in a Hilbert 
space Ti and w ^Ti, then the minimum value of 

N 

W - ^ CnUn 
n=0 

for all choices of Cn G M occurs when Cn = {w,Un). 
Proof. Let c„ be arbitrary real numbers and define a„ = {w, Un) Let 

N N 
U = ttnUn, and V = CnUn- 

n=0 n=0 

Notice that by Theorem fl5.3.2p {u,u) = X]^=o'^n ("^i'^) = J2n=o^n- ^Iso 



N 



N 



{W, v) Cn{w, Un) = ^ a„C„ 



n=0 



n=0 



Hence 



\w — f|p = {w — v,w — v) 



— 2{w, v) + ||f 

N N 

2 ^ a„c„ + ^cl 



\w\\ — 



n=0 



?i=0 



N 



\W\ 



n=0 



It foUows tliat 



\W\\ — M 



n=0 

TV 



n=0 



\U\ 



for any choices of the c^'s and we have equahty if only if c„ = a„ = {w, Un). That is, 
for all choices of f , the minimum value of ||iy — f |p occurs precisely when v = u. □ 
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Definition 5.4.4. //{m„}^q is an orthonormal family of vectors in a Hilbert space 
Ti, it is called complete if every w Eli. can be written as an infinite series 



W = ^ CnUn 
n=0 

for some choice of the numbers Cn G M. 

Theorem (15.4.31) suggests that the only reasonable choice for c„ is c„ = {w,Un) 
and we will show that this is the case. These numbers are sufficiently frequently used 
that they have a name. 

Definition 5.4.5 (Fourier Series). The Fourier coefficients of w with respect to an 
orthonormal family {"Unj^o '^'^^ numbers {w,Uri)- The infinite series 

OD 

n=0 

is called the Fourier series. 

Example 5.4.6 (Classical Fourier Series). We will show later that the orthnormal 
family of functions JF = {-^, cos(n7ra;), sin(?T,7rx), is complete. If / G L^[—l, 1], 
then the Fourier coefficients are 



An = J f cos{nnx) dfi for n > 
Bn = I f sin(n7ra;) d/i for n > , 



and the Fourier series is 

^ oo oo 

-py4o + An cos(?7,7rx) + i?„ sin(n7ra;) 



x/2 



n=l n=l 



Theorem 5.4.7 (Bessel's Inequality). If {ui}^^ is an orthonormal family of vectors 
in a Hilbert space Ti and w E Ti, then the series 

OD 

i=0 

In particular this series converges. 
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Proof. Let Sn be the partial sum for the Fourier series. That is, s„, = ^"=o('"^' '^n)un- 
Then since the family is orthogonal, we know by Theorem fl5.3.2p that 

n n 
Pnf = X] \\{w,Ui)Ui\\'^ = ^{w,Ui)'^. (5.4.1) 

1=0 2=0 

This implies that s„ ± (w — s„) because 

n 

{W -Sn,Sn) = {w,Sn) - (s„, S„) = - \\s„,f = 0. 

i=Q 

Since s„ ± (w — s„) we know 

= + _ s„||2 (5.4.2) 



by Theorem (15.3.21) again. Hence by equation (15.4. ip '^"^oiw , Un)'^ = < 
Since is an increasing sequence it follows that the series 



\wP. 



E{w,Un)'^ = lim \\Snf < 
n. — >cyri 



i=0 

converges. □ 

Corollary 5.4.8. //{m„}^q is an orthonormal family of vectors in a Hilbert space H 
and w eH., then the Fourier series Yliloi'^y'^d'^i '^^^^ respect to {wij^o converges. 

Proof. Let s„ be the partial sum for the Fourier series. That is, s„ = XliLo^"^' 
So if n > m, s„ - = ^,=^+1 {w, Ui)ui. 

Then since the family is orthogonal, we know by Theorem (I5.3.2p that 



Sri. S 



n ■^•m II 

i=m+l i=m+l 



Since the series ^^q^""^' '"*)^ converges we conclude that given e > there is an > 
such that II s„ — s^lP < whenever n,m > N. In other words the sequence {sn} is 
Cauchy. □ 

If Bessel's inequality is actually an equality, then the Fourier series for w must 
converge to w in Tt. 
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Theorem 5.4.9 (Parseval's Theorem). //{m„}^q is an orthonormal family of vectors 
in a Hilbert space Ti and w E Ti., then 

oo 
i=0 

if and only if the Fourier series with respect to {m„}5^q converges to w, i.e. 

oo 

^{w,Ui)Ui = W. 
1=0 

Proof. As above let s„ be the partial sum for the Fourier series. We showed in 
equation (I5.4.2p that \\w\\^ = + — s„||^. Clearly, then, lim||w — = if and 
only if lim||s„|p = Equivalently (using equation fl5.4.ip ) Yl^=o{'^i'^n)un = w 

if and only if Yl'^=o('^ ^ '^n)'^ ~ ll""^!!^- '-' 

Recall that an algebra of functions is a vector space A of real valued functions 
with the additional property that if f,g E A, then f g E A. If X = [a, b] is a closed 
interval in M we will denote by C{X) the algebra of all continuous functions from X 
to M and by Cend{X) = {/ | /(a) = /(&)}, the subalgebra of functions which agree 
at the endpoints. The following theorem is a special case of a much more general 
theorem called the Stone- Weierstrass Theorem. 

Theorem 5.4.10. Suppose that X = [—1, 1] and A C Cend{X) is an algebra satisfying 

1. The constant function 1 is in A, and 

2. A separates points (except endpoints): for any distinct x,y E X with {x,y} ^ 
{ — 1, 1} there is f E A such that f{x) ^ f{y)- 

Then A is dense in C{X), i.e. given any e > and any g E C{X) there is f E A 
such that \ f{x) — g{x)\ < e for all x E X. 

A proof can be found in 5.8.2 of [2J or in [4|. This result is usually stated in 
greater generality than we do here. For example the set X need only be a compact 
metric space, but since we have not defined these concepts we state only the special 
case above. 

Corollary 5.4.11. If e > and g : [—1, 1] M is a continuous function satisfying 
g{—l) = g{^), then there are an,bn E M such that \g{x) — p{x)\ < e, for all x, where 

N N 

p{x) = + a„ cos(n7rx) + 6„ sin(n7ra;). 

n=l n=l 
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Proof. Let X be the unit circle in the plane M?, i.e. X = {(cos(7ra;), sin(7ra;)) | x G 
[—1,1]}. So if : [—1,1] — > is given by = (cos(7ra;), sin(7ra;)), then X = 

0([-l, 1]). For any function / : [-1, 1] M, with /(-I) = /(I) we define f : X R 
to be the continuous function such that f{(f){x)) = f{x). We need the fact that 
/(—I) = /(I) because 0(— 1) = 0(1)- Conversely given any function h G C{X) we 
can define h : [—1, 1] ^ M by = h{(j){x)) and we will have h{—l) = h{l). 
Let A be the collection of all functions on [—1, 1] of the form 

N N 

q{x) = ao + fln cos(n7ra;) + 6„ sin(n7ra;). 

n=l n=l 

for some choices of A^, a„, and 6,1. Then ^ is a vector space and contains the constant 
function 1. It is an algebra as a consequence of the trigonometric identities 

sin(a;) cos(y) = 2 ( + + sin(a; — y)) 
cos(x) cos{y) = 2 ( + + cos(x — y)) 
sin(x) sin(?/) = 2 ( cos(x + — cos(x — y)) 

It is also the case that A separates points with the exception of the one pair of 
points X = —1, y = 1. To see this note that if x and y are not this pair and if one 
is positive and one negative, then sin('7rx) 7^ sin('7r?/). On the other hand if both are 
> or both < 0, then cos(7rx) 7^ cos(7r?/). 

It follows that if ^ = {g I g G A}, then A is an algebra which separates points 
of X and contains the constaint function 1. Note that the points x = —1, y = 1 
correspond to a single point of X, namely (—1, 0) = 0(— 1) = 0(1)- So they cause no 
problem. Thus A satisfies the hypothesis of the Stone-Weierstrass theorem. 

Hence given e > and (7 : [— 1, 1] — M, a continuous function satisfying (?(- 1) = 
giX), we consider g. By Stone-Weierstrass there is a p G ^ such that the value of p 
differs from the value of g by less than e for all points of X. Thus if p{x) = p{(j){x)) 
we have \g{x) — p{x)\ = \g{(l){x)) — p((/)(a;))| < e and p E A. □ 

Theorem 5.4.12. If f & L^[— 1,1], then the Fourier series for f with respect to 
the orthonormal family T converges to f in L^[— 1, 1]. In particular the orthonormal 
family T is complete. 

Proof. Given e > 0, we know by Proposition fl5.2.3p there is a continuous function 
g G L^[-l, 1] such that ^(-1) = ^(1) and ||/ - g\\ < e/2. 
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By the corollary to the Stone- Weierstrass theorem there is a function 




n=l n=l 



with \g{x) 



p{x)\ < e/4 for all x. So 



g-p 



TV 



1 



jig-pf d^^<-j-d^^=- 



Hence, ||/ 
Let 



p\\ < \\f -g\\ + \\g-p\\ <e/2 + e/V8< e. 




where An and Bn are the Fourier coefficients for / with respect to JF. Then Sn{x) 
is the partial sum of the Fourier series of /. According to Theorem fl5.4.3l) for every 
m> N, 11/ - S,m\\ < 11/ -p\\ so 11/ - SrnW < c. This proves lim||/ - 5^11 =0. □ 

Exercise 5.4.13. Suppose X = [—1, 1]. 

1. Prove that Cend{X) is a subalgebra of C{X), i.e. it is a vector subspace closed 
under multiplication. 

2. Let Ap be the polynomials 



Appendix A 
Lebesgue Measure 



A.l Introduction 

We want to define a generalization of lengtli called measure for bounded subsets of 
the real line or subsets of the interval [a, 6]. There are several properties which we 
want it to have. For each bounded subset A of M we would like to be able to assign 
a non-negative real number jii^A) that satisfies the following: 

I. Length. \i A= (a, h) or [a, 6], then ^{A) = len{A) = b — a, i.e., the measure of an 

open or closed interval is its length 

II. Translation Invariance. If A C R is a bounded subset of M and c e M, then 

IJ,{A + c) = IJ,{A), where A + c denotes the set (x + c | x e A}. 

III. Countable Additivity. If { A„}^]^ is a countable collection of bounded subsets 

of R, then 

oo oo 

l^{\jAn)<J2l^(^n) 

n=l n=l 

and if the sets are pairwise disjoint, then 



n=l n=l 



Note the same conclusion applies to finite collections {^n}^=i of bounded sets 
(just let /Ij = for i > m). 
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IV. Monotonicity If A G B, then fJ^{A) < ji^B). Actually, this property is a conse- 
quence of additivity since A and B\A are disjoint and their union is B. 

It turns out that it is not possible to find a ^ which satisfies I-IV and which is 
defined for all bounded subsets of the reals. But we can do it for a very large collection 
including the open sets and the closed sets. 

A. 2 Outer Measure 

We first describe the notion of "outer measure" which comes close to what wc want. 
It is defined for all bounded sets of the reals and satisfies properties I and II above. 
It also satisfies the inequality part of the additivity condition, III, which is called 
subadditivity. But it fails to be additive for some choices of disjoint sets. The reso- 
lution of this difficulty will be to restrict its definition to a certain large collection of 
nice sets (called measurable) on which the additivity condition holds. Our task is to 
develop the definition of measurable set, to define the notion of Lebesgue measure for 
such a set and, then to prove that properties I-IV hold, if we restrict our attention to 
measurable sets. 

Suppose A C R is a bounded set and is a countable covering of A by open 
intervals, i.e. A C Un where [/„ = {an,bn)- Then if we were able to define a 
function /i satisfying the properties I-IV above we would expect that 



and hence that fJ^{A) is less than or equal to the infimum of all such sums where we 
consider all possible coverings of A by a countable collection of open intervals. This 
turns out to be a very useful definition. 

Definition A. 2.1 (Lebesgue Outer Measure). Suppose A G IS. is a bounded set and 
U{A) is the collection of all countable coverings of A by open intervals. We define 
the Lebesgue outer measure fJ^*{A) by 



oo 



oo 



oo 



n=l n=l n=l 



oo 




where the infimum is taken over all possible countable coverings of A by open intervals. 
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Notice that this definition together with the definition of a null set, Definition fl2.2.1l) . 
says that a set A C / is a null set if and only if fi*{A) = 0. 

We can immediately show that property I, the length property, holds for Lebesgue 
outer measure. 

Proposition A. 2. 2. For any a, 6 G M with a < b we have ^*{[a,b]) = fi*{{a,b)) = 
b — a. 

Proof. First consider the closed interval [a,b]. It is covered by the single interval 
Ui = (a — e, 6 + e) so fi*{[a,b]) < len([/i) = b — a + 2e. Since 2e is arbitrary we 
conclude that /i*([a, 6]) < b — a. 

On the other hand by the Heine-Borel Theorem any open covering of [a, b] has 
a finite subcovering so it suffices to prove that for any finite cover {f/j}"^]^ we have 
^ len(?7j) > 6 — a as this will imply /x*([a, b]) > b — a. We prove this by induction on 
n the number of elements in the cover by open intervals. Clearly the result holds if 
n = 1. If 72 > 1 we note that two of the open intervals must intersect. This is because 
one of the intervals (say {c,d)) contains b and another interval contains c and hence 
these two intersect. By renumbering the intervals we can assume that f/„,_i and f/„ 
intersect. 

Now define Vn-i = Un-i U and = f/j for i < n — 1. Then {Vi} is an open 
cover of [a, b] containing n — 1 intervals. By the induction hypothesis 

^leniVi) >b-a. 

i=l 

But len([/„_i) + len([/„) > len(y„_i) and len(f/j) = len(Vi_i) for i > 2. Hence 

n n—l 

J2 len(t/i) > J2 ^^^i^i) >b-a. 

1=1 i=l 

This completes the proof that /i*([a, 6]) > b — a and hence that /i*([a, 6]) = b — a. 

For the open interval (a, b) we note that U = (a, b) covers itself so n*{{a, b)) < b—a. 
On the other hand any cover {U.i}'^!^ of (a, b) by open intervals is also a cover of the 
closed interval [a + e, 6 — e] so, as we just showed, 

oo 

^ \en{Ui) >b-a-2e. 

i=l 



As e is arbitrary ^ len(f/j) > b — a and hence fj^*{{a, b)) >b — a which completes our 
proof. □ 
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Two special cases are worthy of note: 

Corollary A. 2. 3. The outer measure of a set consisting of single point is 0. The 
outer measure of the empty set is also 0. 

Lebesgue outer measure satisfies a monotonicity property with respect to inclu- 
sion. 

Proposition A. 2. 4. If A and B are bounded subsets o/R and A C B then Ijl*{A) < 
IX* {B). 

Proof. Since A C -B, every countable cover {[/„} G U{B) of B by open intervals is 
also in U {A) since it also covers A. Thus 

oo oo 

inf |yien(t/„))< inf |yien(t/„)), 

n=l n=l 

so ^*{A) < n*{B). □ 

We can now prove the first part of the countable additivity property we want. 
It turns out that this is the best we can do if we want our measure defined on all 
bounded sets. Note that the following result is stated in terms of a countably infinite 
collection of sets, but it is perfectly valid for a finite collection also. 

Theorem A. 2. 5 (Countable Subadditivity) . // {^njJJLi is a countable collection of 
bounded subsets ofW, then 

oo oo 
n=l n=l 

Proof. By the definition of outer measure we know that each An has a countable cover 
by open intervals {^7"} such that 

oo 
i=l 

But the union of all these covers {C/"} is a countable cover of U^i ^n- So 

oo oo oo oo oo oo 

n=l n=l i=l n=l n=l n=l 

Since this is true for every e the result follows. The result for a finite collection 
{74^}^^^ follows from this by letting = for i > m. □ 
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Corollary A. 2. 6. If A is countable, then yU*(v4) = 

Proof. Suppose A = IJi^il^i}- We saw in Corollary (IA.2.3P that ^*{{xi}) = so 

oo oo 
/x*(A)=/i*(U{x,})< J]^*({X,}) = 0. 

i=l i=l 

which implies fi*{A) = 0. □ 

Since countable sets have outer measure and fi*{[a,b]) = 6 — a we also immedi- 
ately obtain the following non-trivial result (cf. part 4. of Exercise (I0.3.13I) ). 

Corollary A. 2. 7. If a < b, then [a,b] is not countable. 

Outer Lebesgue measure satisfies property 11 of those we enumerated at the be- 
ginning, namely it is translation invariant. 

Theorem A. 2. 8. //c G M and A is a bounded subset o/R, then fi*{A) = n*{A + c) 
where A + c = {x + c \ x & A}. 

We leave the (easy) proof as an exercise. 

Exercise A. 2. 9. 

1. Prove Theorem flA.2.81) . 

2. Prove that given e > there exist a countable collection of open intervals 
Ui,U2, ■ ■ . ,Un, ■ ■ ■ such that |J,^ f/„ contains all rational numbers in M and such 
that Y^'^^i len(t/„) = e. 

3. Give an example of a subset A of I such that fi*{A) = 0, but with the property 
that if Ui, U2, . . . , f/n is a finite cover by open intervals, then Y17=i l6n(f/i) > 1. 



A. 3 Lebesgue Measurable Sets 

In Definition (12.4. ip we defined the cr-algebra to be the cr-algebra of subsets of 
M generated by open intervals and null sets (it is also the cx-algebra of subsets of M 
generated by Borel sets and null sets). We defined a set to be Lebesgue measurable 
if it is in this a-algebra. However now, in order to prove the existence of Lebesgue 
measure, we want to use a different, but equivalent definition. 
Our program is roughly as follows: 
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• We will define a collection Aio of subsets of I The criterion used to define Aio 
is often given as the definition of Lebesgue measurable sets. 

• We will define the Lebesgue measure f^{A) of a set A in Aio to be the outer 
measure of A. 

• We will show that the collection Ai^ is a cr-algebra of subsets of / and in fact 
precisely the cr-algebra so we have defined fi{A) for all A G 

• We will prove that fi defined in this way satisfies the properties promised in 
Chapter [21 namely properties I-V of Theorem (12.4.21) . Several of these properties 
follow from the corresponding properties for outer measure /i*, which we proved 
in Section f[0) . 

Henceforth for definiteness we will consider subsets of the unit interval / = [0, 1]. 
We could, of course, use any other closed interval or even, with some extra work, 
the whole real line. Lebesgue outer measure as in Definition flA.2.1|) has most of 
the properties we want. There is one serious problem, however; namely, there exist 
subsets A and B of / such that An5 = and v4U5 = / but /i*(v4) ^ 
That is, the additivity property fails even with two sets whose union is an interval. 

Fortunately, the sets for which it fails are rather exotic and not too frequently 
encountered. Our strategy is to restrict our attention to only certain subsets of 
/ which we will call "measurable" and to show that on these sets /i* has all the 
properties we want. 

If y4 C / we will denote the complement of A by A'^, that is, 

A"" = I\A = {x e I \ X ^ A}. 

Definition A. 3.1 (Alternate Definition of Lebesgue Measurable). Let A4o denote 
the collection of all subsets of I defined as follows: A subset A of I is in Aio provided 
for any subset X G I 

IJ*{A nX)+ i2*{A' nX) = ij*{X). 
For any set A G Aio we define fi{A) to be fi*{A). 

The goal of the remainder of this section is to prove that in fact A^o is nothing 
other than the a-algebra Ai (/) of Lebesgue measurable subsets of / and the function 
: A4q — > M satisfies the properties for Lebesgue measure we claimed in Chapter [2j 
The defining condition above for a set A to be in AIq is often taken as the definition of 
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a Lebesgue measurable subset of / because it is what is needed to prove the properties 
we want for Lebesgue measure. Since we have aheady given a different definition of 
Lebesgue measurable sets in Definition (12.4.11) we will instead prove the properties of 
Aio and fi which we want and, then show Aio = -Mil) so the two definitions coincide. 
Indeed, we will prove in Corollary flA.3.9p that the sets in Aio are precisely the sets 
in the a-algebra generated by Borel subsets and null subsets of /. 

As a first step we show that Aio C -Mil)- 

Proposition A. 3. 2. Every set A G Aio can be written as 

A = B\N 

where B is the intersection of a countable nested family of open sets (and, in partic- 
ular, is a Borel set) and N = A'^ r\B is a null set. It follows that M.^ C M.{I)- 

Proof. Since ^{A) = fi*{A) it follows from the definition of outer measure that for 
any e > there is a cover of A by open intervals [/„ such that 

oo oo 

Y,f^*iUn) = J2len{Un) < fi*{A) + e. 

n=l 71=1 

The monotonicity of outer measure, then implies that if K = W^^iUn we have < 
fi*{V,) < + e. If we let 

k 

Wk = []Vi 

i=l 

, then each {Wk} is a nested family of open sets and ^*{A) < fi*{Wk) < + l/k. 

open. 

Let B = n^^VFfe. By monotonicity again we have 

^,*{A)<f,*{B)<^,*{Vk)<^I*{A) + ^. 

Since this holds for all /c > we conclude fJ.*{B) = fi*{A). 

In the defining equation of A4q (see Definition (IA.3.11) we take X = B and conclude 

/i*(A n 5) + i2*{A' nB)= fi*{B). 

Since A C B we have AnB = A and hence fi*{A) + fi*{A'' n B) = fi*{B). From the 
fact that /i*(5) = fi*{A) it follows that /i*(v4^ n B) = 0. Therefore if N = A^n B, 
then is a null set. Finally A = B \ {B n A") so A = B \ N. □ 
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The definition of tMq is relatively simple, but to show it has the properties we 
want requires some work. If we were to replace the = sign in this definition with > 
we would obtain a statement which is true for all subsets of /. So to prove a set is in 
M.0 we need only check the reverse inequality. More precisely, 

Proposition A. 3. 3. Suppose A G I, then 

(1) The set A is in M.^ provided for any subset X G I 

^i*{A n X) + n X) < ij*{X). 

(2) The set A is in Aio if and only if A'^ is in M.^. In this case /i(A^) = 1 — ^{A). 

Proof. For part (1) observe X = {Ar\ X) U {A^ fl X) so the subadditivity property of 
outer measure in Theorem ( IA.2.51) tells us that 

n X) + n X) > /i*(x) 

always holds. This plus the inequality of our hypothesis gives the equality of the 
definition of Alo- 

For part (2) suppose A is an arbitrary subset of /. The fact that [A^Y = ^ implies 
immediately from Definition (lA.3.ip that A is in A^q if ^^nd only if A'^ is. Also taking 
X = J in this definition we conclude 

^*{A n J) + n J) = ^*(J) = 1. 

So = 1. □ 

Proposition A. 3. 4. A set A G I is a null set if and only if A G A4o O'^d ^{A) = 0. 

Proof. By definition a set A is a null set if and only if fi*{A) = 0. If A is a null set, then 
since ACiX G A we know by the monotonicity of outer measure (Proposition ( IA.2.41) ) 
that n X) = 0. Similarly, since n X C X we know that fi*{A'' n X) < /i*(X). 
Hence again using monotonicity of outer measure from Proposition (1A.2.4P we know 
that 

fx*{A n X) + n X) = fi*{A^ n X) < /i*(x) 

and the fact that A G Aio follows from part (1) Proposition ( ]A.3.3[ □ 
Proposition A. 3. 5. If A and B are in M.q, then AVJ B and APi B are in M.q. 
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Proof. To prove if two sets, A and B, then there union is in Aio requires some work. 
Suppose X C I. And since {AU B) n X = {B n X) U {An B" n X), the subadditivity 
of Theorem flA.2.51) tells us 

u 5) n X) < n X) + i2*{A nB'n x). (A.3.1) 

Also the definition of A^o tells us 

fi*{B^ nx) = fi*{AnB^nx) + fi*{A^ nB^n x). (A.3.2) 

Notice that {A U By = A^ n B^. So we get 

u 5) n X) + u BY n x) 
= ^*{{A u 5) n X) + n 5^ n X) 
< n X) + n n X) + n 5" n x) 
= /i*(5nx) + /i*(5'^nx) 

= /i*(X). 

According to part (1) of Proposition flA.3.31) this implies that A U -B is in Ai^. 

The intersection now follows easily using what we know about the union and 
complement. More precisely, Ar\ B = {A'^ U B'^Y so if A and B are in TMoj then so 
is {A'^ U B^) and hence its complement (A^ U B'^Y is also. □ 

Next we wish to show intervals are in A^o- 

Proposition A. 3. 6. Any subinterval of I , open, closed or half open, in Aio- 

Proof. First consider [0,a] with complement (a, 1]. If X is an arbitrary subset of / 
we must show /i*([0,a] n X) + /i*((a, 1] n X) = /i*(X). Let X" = [0,a] n X and 
X"*" = (a, 1] n X. Given e > 0, the definition of outer measure tells us we can find a 
countable cover of X by open intervals {Un}'^=i such that 

oo 

J2 len(f/n) < /x*(X) + e. (A.3.3) 

n=l 

Let t/^ = f/„ n [0,a] and f/+ = t/„ n (a,l]. Then X~ C U^^i f/^ and X+ C 
U^i U^- Subadditivity of outer measure imphes 



by equation ( ]A.3.1ll . 
by equation (IA.3.2p . 



oo oo 

/.*(X-)</.*(Ut/-)<^len([/-) 

n=l n=l 
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and 

oo oo 

/z*(X+)<^*(Uf/+)<^len([/+). 

n=l n=l 

Adding these inequalities and using equation (1A.3.3P we get 



/x*(X-) +^*(X+) < ^len(f/-) + len(f/+ 

n=l 

oo 

= ^len([/„) 



n=l 

</i*(X) + e. 

Since e is arbitrary we conclude that /i*(X^)+/i*(X+) < fi*{X) which by Proposition 
(1A.3.3P implies that [0, a] is in A^o for any < a < 1. A similar argument implies 
that [a, 1] is in Aio. Taking complements, unions and intersections it is clear that 
any interval, open closed or half open, is in A^o- D 

Lemma A. 3. 7. Suppose A and B are disjoint sets in M.q and X <Z I is arbitrary. 
Then 

u B) n X) = n X) + iJ*{B n x). 

The analogous result for a finite union of disjoint measurable sets is also valid. 
Proof. It is always true that 

A n (A u fi) n X = A n X. 

Since A and B are disjoint 

A' n (A u 5) n X = 5 n X. 

Hence the fact that A is in A^o tells us 

n*{{A u 5) n X) = n (A u 5) n X) + n*{A^ n (A u 5) n x) 
= /i*(Anx) + /i*(5nx). 

The result for a finite collection Ai, A2, . . . , An follows immediately by induction 
on n. □ 

Theorem A. 3. 8. The collection Aio of subsets of I is closed under countable unions 
and countable intersections. Hence Aio is a a-algebra. 
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Proof. We have already shown that the complement of a set in A4.Q is a set in A^o- 

We have also shown that the union or intersection of a finite collection of sets in 
Aio is a set in A^o- 

Suppose {An}'^:^i is a countable collection of sets in A4o. We want to construct a 
countable collection of pairwise disjoint sets {-Bnj^i which are in A^q aiid have the 
same union. 

To do this we define Bi = Ai and 

n n 
Bn+l = An+l \ U A„ = An+l H ( |J A^J . 

i=l i=l 

Since finite unions, intersections and complements of sets in M.q are sets in AIq, 
it is clear that Bn is measurable. Also it follows easily by induction that IJILi ~ 
U"^^ Ai for any n. Thus U^^^ Bi = Ai 

Hence to prove IJi^i ^« in A4q we will prove that IJi^i -^o- Let F„ = 

Ur=i ^ ~ Ui^i ^i- If is an arbitrary subset of J, then since F„ is in Aio 

/i*(x) = /x*(F„nx) + /x*(F„^nx) >/x*(F„nx) + /i*(F'=nx) 

since F'^ C F^. By Lemma fOOTD 

n 

/i*(F„nx) = ^^*(5,nx). 

j=i 

Putting these together we have 

n 

fi*{x) > ^ fi*{Bi n X) + n X) 

i=l 

for all n > 0. Hence 

oo 

fi*{x) > J2f^*iBinx) + fi*{F'nx). 

i=l 

But subadditivity of /i* implies 



oo oo 

n X) > /i*(|J(fi. n x)) = n x). 

i=l j=l 
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Hence 

ij,*{x) > fi*{F n X)) + fi*{F^ n X) 

and F is in Mq by Proposition (]A.3.3p . 

To see that a countable intersection of sets in A^q is in TWq we observe that 

oo oo 

n^"=(u^^ 

n=l n=l 

SO the desired result follows from the result on unions together with the fact that A^o 
is closed under taking complements. □ 

Corollary A. 3. 9. The a-algebra Aio of subsets of I equals the a-algebra of 

subsets of I generated by Borel sets and null sets. 

Proof. The cr-algebra AAq contains open intervals and closed intervals in / by Propo- 
sition (1A.3.6P and hence contains the a-algebra they generate, the Borel subsets of J. 
Also A^o contains null sets by Proposition (lA.3.4p . Therefore M.q contains Ai(/) the 
cr-algebra A^o generated by Borel sets and null sets. 

On the other hand by Proposition flA.3.2p Mq<Z M{I). Hence Mq = M{I). □ 

Since we now know the sets in Aio? i-e. the sets which satisfy Definition ( IA.3.11) . 
coincide with the sets in Ai[/], we will refer to them sets as Lebesgue measurable sets, 
or simply measurable sets for short. We also no longer need to use outer measure, 
but can refer to the Lebesgue measure fi{A) of a measurable set A (which, of course, 
has the same value as the outer measure fi*{A)). 

Theorem A. 3. 10 (Countable Additivity). // is a countable collection of 

measurable subsets of I , then 



oo 



n=l n=l 

If the sets are pairwise disjoint, then 



n=l n=l 



The same equality and inequality are valid for a finite collection of measurable subsets 

{^n}™=l- 
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Proof. The first inequality is simply a special case of the subadditivity from Theorem 
flA.2.5|) . If the sets Ai are pairwise disjoint, then by Lemma flA.3.71) we know that for 
each n 

oo n n 

^,{[jA,)>f,{[jA,) = J2fi{A,). 

i=l 1=1 1=1 

Hence 

oo oo 
1=1 i=l 

Since the reverse inequality follows from subadditivity we have equality. □ 

We can now prove the main result of this Appendix, which was presented as 
Theorem fl^X^ in Chapter 2. 

Theorem A. 3. 11 (Existence of Lebesgue Measure). There exists a unique function 
fi, called Lebesgue measure, from to the non-negative real numbers satisfying: 

I. Length. If A = {a,b) then /u(A) = len(y4) = b — a, i.e. the measure of an open 

interval is its length 

II. Translation Invariance. Suppose A G I, c G M and A + c G I where A + c 

denotes the set {x + c \ x E A}. Then fi{A + c) = fi{A) 

III. Countable Additivity. // {v4„}^]^ is a countable collection of subsets of I, 

then 

oo oo 

^^{\JA^)<J2K^n) 

n=l n=l 

and if the sets are pairwise disjoint, then 

oo oo 
n=l n=l 

IV. Monotonicity If A, B e M{I) and A C B then fi{A) < ^{B) 

V. Null Sets A subset A G I is a null set set if and only if A G M.{I) and yu(v4) = 0. 

Proof. The Lebesgue measure f^{A) of any set A G M{I) is defined to be its outer 
measure fi*{A). Hence properties I, II, and IV for fi follow from the corresponding 
properties of fi*. These were established in Propostion flA.2.2p . Theorem flA.2.8p . and 
Proposition flA.2.4p respectively. 
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Property III, countable additivity, was proved in Theorem (]A.3.10I) . And Property 
V is a consequence of Proposition flA.3.41) . 

We are left with the task of showing that /i is unique. Suppose fii and fi2 are two 
functions defined on and satisfying properties I-V. They must agree on any 

open interval by property I. By Theorem flO.4.31) any open set is a countable union of 
pairwise disjoint open intervals, so countable additivity implies /ii and fi2 agree on 
open sets. 

Suppose that B is the intersection of a countable nested family of open sets Ui D 
U2 D ■ ■ ■ ^ Un ■ ■ ■ ■ Then Proposition (I2.4.6P implies 

00 00 

= n U„) = lim fXi{Un) = lim /i2(t/n) = /U2( Pi ^'^) = l^^iB). 

1 ' n^oo n^oo ^11 
n=l n=l 

Also if iV G M{I) is a null set, then /ii(A^) = = /X2(A^) by property V. 

Finally if A is an arbitrary set in Ai (/) by Proposition (]A.3.2I) A = B \ N where 
B is the intersection of a countable nested family of open sets and N = A"^ (1 B is a 
null set. Since B is the disjoint union of A and It follows that 

□ 



Appendix B 

A Non-measurable Set 



We are now prepared to prove the existence of a non-measurable set. The proof 
(necessarily) depends on the Axiom of Choice (see Section flU.3l) ) and is highly non- 
constructive. 

Lemma B.1.1. Let A be a measurable set with fi{A) > and let A = {xi — 

X2 I xi,X2 G A} be the set of differences of elements of A. Then for some e > 
the set A contains the interval (— e,e). 

Proof. By Theorem (12.5.11) there is an open interval U such that fi{Ar\U) > | \en{U). 
Let e = len(?7)/2, so len(f/) = 2e. Suppose y G (— e, e) and let U+y = {x+y \ x G U},, 
then U U {U + y) is an interval of length at most 3e. 

Now let B = AnU and B' = B + y. Then ii{B') = 12(B) > | len(f/) = |e so 
fi{B') + fi{B) > 3e. On the other hand the fact that B U B' C U U {U + y) implies 
fi{B U B') < 3e. It follows that B and B' cannot be disjoint since otherwise we would 
contradict additivity. 

If xi G -B n B', then xi = X2 + y for some X2 G B. Hence y = Xi — X2 & A. We 
have shown that any y G (— e, e) is in A. □ 

Theorem B.1.2 (Non-measurable Set). There exists a subset E of [0, 1] which is not 
Lebesgue measurable. 

Proof. Let Q C R denote the rational numbers. The rationals are an additive sub- 
group of M and we wish to consider the "cosets" of this subgroup. More precisely, we 
want to consider the sets of the form Q + x where x G M. 

We observe that two such sets Q + xi and Q + X2 are either equal or disjoint. This 
is because the existence of one point z G (Q + xi) fl (Q + X2) implies z = xi + ri = 
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X2 + r2 with ri,r2 G Q, so Xi — X2 = {r2 — Ti) G Q. This, in turn imphes that 
Q + a;2 = {a;2 + r I r e Q} = {a;2 + r + (a;i-a;2) I r G Q} = {xi+r I r e Q} = Q + xi. 

Using the Axiom of Choice we construct a set E which contains one element from 
each of the cosets Q + x, that is, for any a;o G M the set fl (Q + a;o) contains 
exactly one point. Now let {r^}^]^ be an enumeration of the rational numbers. We 
want to show that M = IJ^^i E + Vn- To see this let x G M be arbitrary and let 
{xq} = E n {Q + x). Then xq = x + r for some r G Q or x = Xq + where tq = —r. 
Hence xG-E — rsoxG IJ^^i ^ + ^n- We have shown M = IJ^i ^ + '^n- 

We now make the assumption that E is measurable and show this leads to a 
contradiction. We first note that if we define A = {xi — X2 | xi,X2 G E}, then A 
contains no rational points except 0. This is because xi = X2 + r for rational r would 
imply that fl (Q + X2) D {xi,X2} and this intersection contains only one point. 
Since A contains at most one rational point it cannot contain an open interval so by 
Lemma fIB.l.ip we must conclude that fi{E) = 0. 



But if we define Vn = {E + r„) n [0, 1], then < ^iE + r„) = = so 

fJ'{Vn) = 0. The fact that M = IJ^^^ E + r„ implies 

00 00 

[0,i] = |J((i? + rOn[0,i]) = |jK. 

n=l n=l 

Subadditivity, then would imply /i([0, 1]) < Yl'^=i /^(^) = which clearly contradicts 
our assumption that E is measurable. □ 
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